Optimizing glycan processing in plants

ABSTRACT

The invention is directed to methods for optimizing glycan processing in organisms (and in particular, plants) so that a glycoprotein having complex type bi-antennary glycans and thus 5 containing galactose residues on both arms and which are devoid of (or reduce in) xylose and fucose can be obtained. The invention is further directed to said glycoprotein obtained and host system comprising said protein.

FIELD OF THE INVENTION

The invention is directed to methods for optimizing glycan processing ofcell or an organism containing glycoproteins with N-glycans, inparticular plants so that a glycoprotein having an N-glycan, highmannose type, hybrid or preferably complex type N-glycans, including butnot limited to bi-antennary N-glycans, and containing a galactoseresidue on at least one arm of the N-glycan and which are devoid of (orreduced in) xylose and fucose residues can be obtained. The invention isfurther directed to said glycoprotein obtained and in particular a planthost system comprising said protein.

BACKGROUND OF THE INVENTION

N-linked glycans, specific oligosaccharide structures attached toasparagine residues of glycoproteins, can contribute significantly tothe properties of the protein and, in turn, to the properties of theorganism. Plant proteins can carry N-linked glycans but in markedcontrast to mammals only few biological processes are known to whichthey contribute.

Biogenesis of N-linked glycans begins with the synthesis of a lipidlinked oligosaccharide moiety (Glc3Man9GlcNAc2-) which is transferred enbloc to the nascent polypeptide chain in the endoplasmic reticulum (ER).Through a series of trimming reactions by exoglycosidases in the ER andcis-Golgi compartments, the so-called “high mannose” (Man9GlcNAc2 toMan5GlcNAc2) glycans are formed. Subsequently, the formation of complextype glycans starts with the transfer of the first GlcNAc ontoMan5GlcNAc2 by GnTI and further trimming by mannosidase II (ManII) toform GlcNAcMan3GlcNAc2. Complex glycan biosynthesis continues while theglycoprotein is progressing through the secretory pathway with thetransfer in the Golgi apparatus of the second GlcNAc residue by GnTII aswell as other monosaccharide residues onto the GlcNAcMan3GlcNAc2 underthe action of several other glycosyl transferases.

Plants and mammals differ with respect to the formation of complexglycans (see FIG. 1, which compares the glycosylation pathway ofglycoproteins in plants and mammals). In plants, complex glycans arecharacterized by the presence of β(1,2)-xylose residues linked to theMan-3 and/or an α(1,3)-fucose residue linked to GlcNAc-1, instead of anat(l,6)-fucose residue linked to the GlcNAc-1. Genes encoding thecorresponding xylosyl (XylT) and fucosyl (FucT) transferases have beenisolated [Strasser et al., “Molecular cloning and functional expressionof beta1,2-xylosyltransferase cDNA from Arabidopsis thaliana,” FEBSLett. 472:105 (2000); Leiter et al., “Purification, cDNA cloning, andexpression of GDP-L-Fuc:Asn-linked GlcNAc alpha 1,3-fucosyltransferasefrom mung beans,” J. Biol. Chem. 274:21830 (1999)]. Plants do notpossess β(1,4)-galactosyltransferases nor α(2,6)sialyltransferases andconsequently plant glycans lack the β(1,4)-galactose and terminalα(2,6)NeuAc residues often found on mammalian glycans.

The final glycan structures are not only determined by the mere presenceof enzymes involved in their biosynthesis and transport but to a largeextent by the specific sequence of the various enzymatic reactions. Thelatter is controlled by discrete sequestering and relative position ofthese enzymes throughout the ER and Golgi, which is mediated by theinteraction of determinants of the transferase and specificcharacteristics of the sub-Golgi compartment for which the transferaseis destined. A number of studies using hybrid molecules have identifiedthat the transmembrane domains of several glycosyltransferases,including that of β(1,4)galactosyltransferases, play a central role intheir sub-Golgi sorting [Grabenhorst et al., J. Biol. Chem 274:36107(1999); Colley, K., Glycobiology 7:1 (1997); Munro, S., Trends CellBiol. 8:11 (1998); Gleeson, P. A., Histochem. Cell Biol. 109:517(1998)].

Although plants and mammals have diverged a relatively long time ago,N-linked glycosylation seems at least partly conserved. This isevidenced by the similar though not identical glycan structures and bythe observation that a mammalian GlcNAcTI gene complements a Arabidopsismutant that is deficient in GlcNAcTI activity, and vice versa. Thedifferences in glycan structures can have important consequences. Forexample, xylose and α(1,3)-fucose epitopes are known to be highlyimmunogenic and possibly allergenic in some circumstances, which maypose a problem when plants are used for the production of therapeuticglycoproteins. Moreover, blood serum of many allergy patients containsIgE directed against these epitopes but also 50% of non-allergic blooddonors contains in their sera antibodies specific for core-xylosewhereas 25% have antibodies for core-alpha 1,3-fucose (Bardor et al.,2002, in press, Glycobiology) (Advance Access published Dec. 17, 2002)which make these individuals at risk to treatments with recombinantproteins produced in plants containing fucose and/or xylose. Inaddition, this carbohydrate directed IgE in sera might cause falsepositive reaction in in vitro tests using plant extracts since there isevidence that these carbohydrate specific IgE's are not relevant for theallergenic reaction. In sum, a therapeutic failure with a glycoproteinproduced in plants might be the result of accelerated clearance of therecombinant glycoprotein having xylose and/or fucose.

Accordingly, there is a need to better control glycosylation in plants,and particularly, glycosylation of glycoproteins intended fortherapeutic use.

DEFINITIONS

To facilitate understanding of the invention, a number of terms as usedin this specification are defined below.

The term “vector” refers to any genetic element, such as a plasmid,phage, transposon, cosmid, chromosome, retrovirus, virion, or similargenetic element, which is capable of replication when associated withthe proper control elements and which can transfer gene sequences intocells and/or between cells. Thus, this term includes cloning andexpression vehicles, as well as viral vectors.

The term “expression vector” as used herein refers to a recombinant DNAmolecule containing a desired coding sequence (or coding sequences)—suchas the coding sequence(s) for the hybrid enzyme(s) described in moredetail below—and appropriate nucleic acid sequences necessary for theexpression of the operably linked coding sequence in a particular hostcell or organism. Nucleic acid sequences necessary for expression inprokaryotes usually include a promoter, an operator (optional), and aribosome binding site, often along with other sequences. Eukaryoticcells are known to utilize promoters, enhancers, and termination andpolyadenylation signals. It is not intended that the present inventionbe limited to particular expression vectors or expression vectors withparticular elements.

The term “transgenic” when used in reference to a cell refers to a cellwhich contains a transgene, or whose genome has been altered by theintroduction of a transgene. The term “transgenic” when used inreference to a cell, tissue or to a plant refers to a cell, tissue orplant, respectively, which comprises a transgene, where one or morecells of the tissue contain a transgene (such as a gene encoding thehybrid enzyme(s) of the present invention), or a plant whose genome hasbeen altered by the introduction of a transgene. Transgenic cells,tissues and plants may be produced by several methods including theintroduction of a “transgene” comprising nucleic acid (usually DNA) intoa target cell or integration of the transgene into a chromosome of atarget cell by way of human intervention, such as by the methodsdescribed herein.

The term “transgene” as used herein refers to any nucleic acid sequencewhich is introduced into the genome of a cell by experimentalmanipulations. A transgene may be an “endogenous DNA sequence,” or a“heterologous DNA sequence” (i.e., “foreign DNA”). The term “endogenousDNA sequence” refers to a nucleotide sequence which is naturally foundin the cell into which it is introduced so long as it does not containsome modification (e.g., a point mutation, the presence of a selectablemarker gene, or other like modifications) relative to thenaturally-occurring sequence. The term “heterologous DNA sequence”refers to a nucleotide sequence which is ligated to, or is manipulatedto become ligated to, a nucleic acid sequence to which it is not ligatedin nature, or to which it is ligated at a different location in nature.Heterologous DNA is not endogenous to the cell into which it isintroduced, but has been obtained from another cell. Heterologous DNAalso includes an endogenous DNA sequence which contains somemodification. Generally, although not necessarily, heterologous DNAencodes RNA and proteins that are not normally produced by the cell intowhich it is expressed. Examples of heterologous DNA include reportergenes, transcriptional and translational regulatory sequences,selectable marker proteins (e.g., proteins which confer drugresistance), or other similar elements.

The term “foreign gene” refers to any nucleic acid (e.g., gene sequence)which is introduced into the genome of a cell by experimentalmanipulations and may include gene sequences found in that cell so longas the introduced gene contains some modification (e.g., a pointmutation, the presence of a selectable marker gene, or other likemodifications) relative to the naturally-occurring gene.

The term “fusion protein” refers to a protein wherein at least one partor portion is from a first protein and another part or portion is from asecond protein. The term “hybrid enzyme” refers to a fusion proteinwhich is a functional enzyme, wherein at least one part or portion isfrom a first species and another part or portion is from a secondspecies. Preferred hybrid enzymes of the present invention arefunctional glycosyltransferases (or portions thereof) wherein at leastone part or portion is from a plant and another part or portion is froma mammal (such as human).

The term “introduction into a cell” or “introduction into a host cell”in the context of nucleic acid (e.g., vectors) is intended to includewhat the art calls “transformation” or “transfection” or “transduction.”Transformation of a cell may be stable or transient—and the presentinvention contemplates introduction of vectors under conditions where,on the one hand, there is stable expression, and on the other hand,where there is only transient expression. The term “transienttransformation” or “transiently transformed” refers to the introductionof one or more transgenes into a cell in the absence of integration ofthe transgene into the host cell's genome. Transient transformation maybe detected by, for example, enzyme-linked immunosorbent assay (ELISA)which detects the presence of a polypeptide encoded by one or more ofthe transgenes. Alternatively, transient transformation may be detectedby detecting the activity of the protein (e.g., antigen binding of anantibody) encoded by the transgene (e.g., the antibody gene). The term“transient transformant” refers to a cell which has transientlyincorporated one or more transgenes. In contrast, the term “stabletransformation” or “stably transformed” refers to the introduction andintegration of one or more transgenes into the genome of a cell. Stabletransformation of a cell may be detected by Southern blot hybridizationof genomic DNA of the-cell with nucleic acid sequences which are capableof binding to one or more of the transgenes. Alternatively, stabletransformation of a cell may also be detected by the polymerase chainreaction (PCR) of genomic DNA of the cell to amplify transgenesequences. The term “stable transformant” refers to a cell which hasstably integrated one or more transgenes into the genomic DNA. Thus, astable transformant is distinguished from a transient transformant inthat, whereas genomic DNA from the stable transformant contains one ormore transgenes, genomic DNA from the transient transformant does notcontain a transgene.

The term “host cell” includes both mammalian (e.g. human B cell clones,Chinese hamster ovary cells, hepatocytes) and non-mammalian cells (e.g.insect cells, bacterial cells, plant cells). In one embodiment, the hostcells are mammalian cells and the introduction of a vector expressing ahybrid protein of the present invention (e.g TmGnTII-GalT) inhibits (orat least reduces) fucosylation in said mammalian cells.

The term “nucleotide sequence of interest” refers to any nucleotidesequence, the manipulation of which may be deemed desirable for anyreason (e.g., confer improved qualities, use for production oftherapeutic proteins), by one of ordinary skill in the art. Suchnucleotide sequences include, but are not limited to, coding sequencesof structural genes (e.g., reporter genes, selection marker genes,oncogenes, antibody genes, drug resistance genes, growth factors, andother like genes), and non-coding regulatory sequences which do notencode an mRNA or protein product, (e.g., promoter sequence,polyadenylation sequence, termination sequence, enhancer sequence, andother like sequences). The present invention contemplates host cellsexpressing a heterologous protein encoded by a nucleotide sequence ofinterest along with one or more hybrid enzymes.

The term “isolated” when used in relation to a nucleic acid, as in “anisolated nucleic acid sequence” refers to a nucleic acid sequence thatis identified and separated from one or more other components (e.g.,separated from a cell containing the nucleic acid, or separated from atleast one contaminant nucleic acid, or separated from one or moreproteins, one or more lipids) with which it is ordinarily associated inits natural source. Isolated nucleic acid is nucleic acid present in aform or setting that is different from that in which it is found innature. In contrast, non-isolated nucleic acids are nucleic acids suchas DNA and RNA which are found in the state they exist in nature. Forexample, a given DNA sequence (e.g., a gene) is found on the host cellchromosome in proximity to neighboring genes; RNA sequences, such as aspecific mRNA sequence encoding a specific protein, are found in thecell as a mixture with numerous other mRNAs which encode a multitude ofproteins. However, an isolated nucleic acid sequence comprising SEQ IDNO:1 includes, by way of example, such nucleic acid sequences in cellswhich ordinarily contain SEQ ID NO:1 where the nucleic acid sequence isin a chromosomal or extrachromosomal location different from that ofnatural cells, or is otherwise flanked by a different nucleic acidsequence than that found in nature. The isolated nucleic acid sequencemay be present in single-stranded or double-stranded form. When anisolated nucleic acid sequence is to be utilized to express a protein,the nucleic acid sequence will contain at a minimum at least a portionof the sense or coding strand (i.e., the nucleic acid sequence may besingle-stranded). Alternatively, it may contain both the sense andanti-sense strands (i.e., the nucleic acid sequence may bedouble-stranded).

As used herein, the term “purified” refers to molecules, either nucleicor amino acid sequences, that are removed from their naturalenvironment, isolated or separated. An “isolated nucleic acid sequence”is therefore a purified nucleic acid sequence. “Substantially purified”molecules are at least 60% free, preferably at least 75% free, and morepreferably at least 90% free, from other components with which they arenaturally associated. The present invention contemplates both purified(including substantially purified) and unpurified hybrid enzyme(s)(which are described in more detail below).

As used herein, the terms “complementary” or “complementarity” are usedin reference to nucleotide sequences related by the base-pairing rules.For example, the sequence 5′-AGT-3′ is complementary to the sequence5′-ACT-3′. Complementarity can be “partial” or “total.” “Partial”complementarity is where one or more nucleic acid bases is not matchedaccording to the base pairing rules. “Total” or “complete”complementarity between nucleic acids is where each and every nucleicacid base is matched with another base under the base pairing rules. Thedegree of complementarity between nucleic acid strands has significanteffects on the efficiency and strength of hybridization between nucleicacid strands.

A “complement” of a nucleic acid sequence as used herein refers to anucleotide sequence whose nucleic acids show total complementarity tothe nucleic acids of the nucleic acid sequence. For example, the presentinvention contemplates the complements of SEQ ID NOS: 1, 3, 5, 9, 27,28, 29, 30, 31, 32, 33, 34, 35, 37, 38, 40, 41 and 43.

The term “homology” when used in relation to nucleic acids refers to adegree of complementarity. There may be partial homology (i.e., partialidentity) or complete homology (i.e., complete identity). A partiallycomplementary sequence is one that at least partially inhibits acompletely complementary sequence from hybridizing to a target nucleicacid and is referred to using the functional term “substantiallyhomologous.” The inhibition of hybridization of the completelycomplementary sequence to the target sequence may be examined using ahybridization assay (Southern or Northern blot, solution hybridizationand the like) under conditions of low stringency. A substantiallyhomologous sequence or probe (i.e., an oligonucleotide which is capableof hybridizing to another oligonucleotide of interest) will compete forand inhibit the binding (i.e., the hybridization) of a completelyhomologous sequence to a target under conditions of low stringency. Thisis not to say that conditions of low stringency are such thatnon-specific binding is permitted; low stringency conditions requirethat the binding of two sequences to one another be a specific (i e.,selective) interaction. The absence of non-specific binding may betested by the use of a second target which lacks even a partial degreeof complementarity (e.g., less than about 30% identity): in the absenceof non-specific binding the probe will not hybridize to the secondnon-complementary target.

When used in reference to a double-stranded nucleic acid sequence suchas a cDNA or genomic clone, the term “substantially homologous” refersto any probe which can hybridize to either or both strands of thedouble-stranded nucleic acid sequence under conditions of low stringencyas described infra.

When used in reference to a single-stranded nucleic acid sequence, theterm “substantially homologous” refers to any probe which can hybridizeto the single-stranded nucleic acid sequence under conditions of lowstringency as described infra.

The term “hybridization” as used herein includes “any process by which astrand of nucleic acid joins with a complementary strand through basepairing.” [Coombs J (1994) Dictionary of Biotechnology, Stockton Press,New York N.Y.]. Hybridization and the strength of hybridization (i.e.,the strength of the association between the nucleic acids) is impactedby such factors as the degree of complementarity between the nucleicacids, stringency of the conditions involved, the T_(m) of the formedhybrid, and the G:C ratio within the nucleic acids.

As used herein, the term “T_(m)” is used in reference to the “meltingtemperature.” The melting temperature is the temperature at which apopulation of double-stranded nucleic acid molecules becomes halfdissociated into single strands. The equation for calculating the T_(m)of nucleic acids is well known in the art. As indicated by standardreferences, a simple estimate of the T_(m) value may be calculated bythe equation: T_(m)=81.5+0.41(% G+C), when a nucleic acid is in aqueoussolution at 1 M NaCl [see e.g., Anderson and Young, Quantitative FilterHybridization, in: Nucleic Acid Hybridization (1985)]. Other referencesinclude more sophisticated computations which take structural as well assequence characteristics into account for the calculation of T_(m).

Low stringency conditions when used in reference to nucleic acidhybridization comprise conditions equivalent to binding or hybridizationat 68° C. in a solution consisting of 5× SSPE (Saline, Sodium Phosphate,EDTA) (43.8 g/l NaCl, 6.9 g/l NaH₂PO₄.H₂O and 1.85 g/l EDTA(Ethylenediaminetetracetic Acid), pH adjusted to 7.4 with NaOH), 0.1%SDS (Sodium dodecyl sulfate), 5× Denhardt's reagent [50× Denhardt'scontains the following per 500 ml: 5 g Ficoll (Type 400, Pharmacia), 5 gBSA (Bovine Serum Albumin) (Fraction V; Sigma)] and 100 μg/l denaturedsalmon sperm DNA followed by washing in a solution comprising between0.2× and 2.0× SSPE, and 0.1% SDS at room temperature when a DNA probe ofabout 100 to about 1000 nucleotides in length is employed.

High stringency conditions when used in reference to nucleic acidhybridization comprise conditions equivalent to binding or hybridizationat 68° C. in a solution consisting of 5× SSPE, 1% SDS, 5× Denhardt'sreagent and 100 μg/ml denatured salmon sperm DNA followed by washing ina solution comprising 0.1× SSPE, and 0.1% SDS at 68° C. when a probe ofabout 100 to about 1000 nucleotides in length is employed.

The term “equivalent” when made in reference to a hybridizationcondition as it relates to a hybridization condition of interest meansthat the hybridization condition and the hybridization condition ofinterest result in hybridization of nucleic acid sequences which havethe same range of percent (%) homology. For example, if a hybridizationcondition of interest results in hybridization of a first nucleic acidsequence with other nucleic acid sequences that have from 50% to 70%homology to the first nucleic acid sequence, then another hybridizationcondition is said to be equivalent to the hybridization condition ofinterest if this other hybridization condition also results inhybridization of the first nucleic acid sequence with the other nucleicacid sequences that have from 50% to 70% homology to the first nucleicacid sequence.

When used in reference to nucleic acid hybridization the art knows wellthat numerous equivalent conditions may be employed to comprise eitherlow or high stringency conditions; factors such as the length and nature(DNA, RNA, base composition) of the probe and nature of the target (DNA,RNA, base composition, present in solution or immobilized) and theconcentration of the salts and other components (e.g., the presence orabsence of formamide, dextran sulfate, polyethylene glycol) areconsidered and the hybridization solution may be varied to generateconditions of either low or high stringency hybridization differentfrom, but equivalent to, the above-listed conditions.

The term “promoter,” “promoter element,” or “promoter sequence” as usedherein, refers to a DNA sequence which when ligated to a nucleotidesequence of interest is capable of controlling the transcription of thenucleotide sequence of interest into mRNA. A promoter is typically,though not necessarily, located 5′ (i.e., upstream) of a nucleotidesequence of interest whose transcription into mRNA it controls, andprovides a site for specific binding by RNA polymerase and othertranscription factors for initiation of transcription.

Promoters may be tissue specific or cell specific. The term “tissuespecific” as it applies to a promoter refers to a promoter that iscapable of directing selective expression of a nucleotide sequence ofinterest to a specific type of tissue (e.g., petals) in the relativeabsence of expression of the same nucleotide sequence of interest in adifferent type of tissue (e.g., roots). Tissue specificity of a promotermay be evaluated by, for example, operably linking a reporter gene tothe promoter sequence to generate a reporter construct, introducing thereporter construct into the genome of a plant such that the reporterconstruct is integrated into every tissue of the resulting transgenicplant, and detecting the expression of the reporter gene (e.g.,detecting mRNA, protein, or the activity of a protein encoded by thereporter gene) in different tissues of the transgenic plant. Thedetection of a greater level of expression of the reporter gene in oneor more tissues relative to the level of expression of the reporter genein other tissues shows that the promoter is specific for the tissues inwhich greater levels of expression are detected. The term “cell typespecific” as applied to a promoter refers to a promoter which is capableof directing selective expression of a nucleotide sequence of interestin a specific type of cell in the relative absence of expression of thesame nucleotide sequence of interest in a different type of cell withinthe same tissue. The term “cell type specific” when applied to apromoter also means a promoter capable of promoting selective expressionof a nucleotide sequence of interest in a region within a single tissue.Cell type specificity of a promoter may be assessed using methods wellknown in the art, e.g., immuno-histochemical staining. Briefly, tissuesections are embedded in paraffin, and paraffin sections are reactedwith a primary antibody which is specific for the polypeptide productencoded by the nucleotide sequence of interest whose expression iscontrolled by the promoter. A labeled (e.g., peroxidase conjugated)secondary antibody which is specific for the primary antibody is allowedto bind to the sectioned tissue and specific binding detected (e.g.,with avidin/biotin) by microscopy.

Promoters may be constitutive or regulatable. The term “constitutive”when made in reference to a promoter means that the promoter is capableof directing transcription of an operably linked nucleic acid sequencein the absence of a stimulus (e.g., heat shock, chemicals, light, orsimilar stimuli). Typically, constitutive promoters are capable ofdirecting expression of a transgene in substantially any cell and anytissue. In contrast, a “regulatable” promoter is one which is capable ofdirecting a level of transcription of an operably linked nuclei acidsequence in the presence of a stimulus (e.g. heat shock, chemicals,light, or similar stimuli) which is different from the level oftranscription of the operably linked nucleic acid sequence in theabsence of the stimulus.

The terms “infecting” and “infection” with a bacterium refer toco-incubation of a target biological sample, (e.g., cell, tissue, plantpart) with the bacterium under conditions such that nucleic acidsequences contained within the bacterium are introduced into one or morecells of the target biological sample.

The term “Agrobacterium” refers to a soil-borne, Gram-negative,rod-shaped phytopathogenic bacterium which causes crown gall; The term“Agrobacterium” includes, but is not limited to, the strainsAgrobacterium tumefaciens, (which typically causes crown gall ininfected plants), and Agrobacterium rhizogens (which causes hairy rootdisease in infected host plants). Infection of a plant cell withAgrobacterium generally results in the production of opines (e.g.,nopaline, agropine, octopine) by the infected cell. Thus, Agrobacteriumstrains which cause production of nopaline (e.g., strain LBA4301, C58,A208) are referred to as “nopaline-type” Agrobacteria; Agrobacteriumstrains which cause production of octopine (e.g., strain LBA4404, Ach5,B6) are referred to as “octopine-type” Agrobacteria; and Agrobacteriumstrains which cause production of agropine (e.g., strain EHA105, EHA101,A281) are referred to as “agropine-type” Agrobacteria.

The terms “bombarding, “bombardment,” and “biolistic bombardment” referto the process of accelerating particles towards a target biologicalsample (e.g., cell, tissue, plant part—such as a leaf, or intact plant)to effect wounding of the cell membrane of a cell in the targetbiological sample and/or entry of the particles into the targetbiological sample. Methods for biolistic bombardment are known in theart (e.g., U.S. Pat. Nos. 5,584,807 and 5,141,131, the contents of bothare herein incorporated by reference), and are commercially available(e.g., the helium gas-driven microprojectile accelerator (PDS-1000/He)(BioRad).

The term “microwounding” when made in reference to plant tissue refersto the introduction of microscopic wounds in that tissue. Microwoundingmay be achieved by, for example, particle bombardment as describedherein. The present invention specifically contemplates schemes forintroducing nucleic acid which employ microwounding.

The term “organism” as used herein refers to all organisms and inparticular organisms containing glycoproteins with n-linked glycans.

The term “plant” as used herein refers to a plurality of plant cellswhich are largely differentiated into a structure that is present at anystage of a plant's development. Such structures include, but are notlimited to, a fruit, shoot, stem, root, leaf, seed, flower petal, orsimilar structure. The term “plant tissue” includes differentiated andundifferentiated tissues of plants including, but not limited to, roots,shoots, leaves, pollen, seeds, tumor tissue and various types of cellsin culture (e.g., single cells, protoplasts, embryos, callus,protocorm-like bodies, and other types of cells). Plant tissue may be inplanta, in organ culture, tissue culture, or cell culture. Similarly,“plant cells” may be cells in culture or may be part of a plant.

Glycosyltransferases are enzymes that catalyze the processing reactionsthat determine the structures of cellular oligosaccharides, includingthe oligosaccharides on glycoproteins. As used herein,“glycosyltransferase” is meant to include mannosidases, even thoughthese enzymes trim glycans and do not “transfer” a monosaccharide.Glycosyltransferases share the feature of a type II membraneorientation. Each glycosyltransferase is comprised of an amino terminalcytoplasmic tail (shown for illustration purposes below as a made up ofa string of amino acids arbitrarily labeled “X”—without intending tosuggest the actual size of the region), a signal anchor domain (shownbelow as made up of a string of amino acids labeled “H” forhydrophobic—without intending to suggest the actual size of the domainand without intending to suggest that the domain is only made up ofhydrophobic amino acids) that spans the membrane (referred to herein asa “transmembrane domain”), followed by a luminal stem (shown below asmade up of a string of amino acids arbitrarily labeled “S”—withoutintended to suggest the actual size of the region) or stalk region, anda carboxy-terminal catalytic domain (shown below as made up of a stringof amino acids arbitrarily labeled “C”—without intending to suggest theactual size of the domain:NH₂—XXXXXXXHHHHHHHHSSSSSSSSCCCCCCCCCollectively, The Cytoplasmic Tail-Transmembrane-Stem Region or “CTS”(which has been underlined in the above schematic for clarity) can beused (or portions thereof) in embodiments contemplated by the presentinvention wherein the catalytic domain is exchanged or “swapped” with acorresponding catalytic domain from another molecule (or portions ofsuch regions/domains) to create a hybrid protein.

For example, in a preferred embodiment, the present inventioncontemplates nucleic acid encoding a hybrid enzyme (as well as vectorscontaining such nucleic acid, host cells containing such vectors, andthe hybrid enzyme itself), said hybrid enzyme comprising at least aportion of a CTS region [e.g., the cytoplasmic tail (“C”), thetransmembrane domain (“T”), the cytoplasmic tail together with thetransmembrane domain (“CT”), the transmembrane domain together with thestem (“TS”), or the complete CTS region] of a first glycosyltransferase(e.g. plant glycosyltransferase) and at least a portion of a catalyticregion of a second glycosyltransferase (e.g. mammalianglycosyltransferase). To create such an embodiment, the coding sequencefor the entire CTS region (or portion thereof) may be deleted fromnucleic acid coding for the mammalian glycosyltransferase and replacedwith the coding sequence for the entire CTS region (or portion thereof)of a plant glycosyltransferase. On the other hand, a different approachmight be taken to create this embodiment; for example, the codingsequence for the entire catalytic domain (or portion thereof) may bedeleted from the coding sequence for the plant glycosyltransferase andreplaced with the coding sequence for the entire catalytic domain (orportion thereof) of the mammalian glycosyltransferase. In such a case,the resulting hybrid enzyme would have the amino-terminal cytoplasmictail of the plant glycosyltransferase linked to the plantglycosyltransferase transmembrane domain linked to the stem region ofthe plant glycosyltransferase in the normal manner of the wild-typeplant enzyme—but the stem region would be linked to the catalytic domainof the mammalian glycosyltransferase (or portion thereof).

It is not intended that the present invention be limited only to the twoapproaches outlined above. Other variations in the approach arecontemplated. For example, to create nucleic acid encoding a hybridenzyme, said hybrid enzyme comprising at least a portion of atransmembrane region of a plant glycosyltransferase and at least aportion of a catalytic region of a mammalian glycosyltransferase, onemight use less than the entire coding sequence for the CTS region (e.g.,only the transmembrane domain of the plant glycosytransferase, or thecomplete cytoplasmic tail together with all or a portion of thetransmembrane domain, or the complete cytoplasmic tail together with allof the transmembrane domain together with a portion of the stem region).One might delete the mammalian coding sequence for the entirecytoplasmic tail together with the coding sequence for the transmembranedomain (or portion thereof)—followed by replacement with thecorresponding coding sequence for the cytoplasmic tail and transmembranedomain (or portion thereof) of the plant glycosyltransferase. In such acase, the resulting hybrid enzyme would have the stem region of themammalian glycosyltransferase linked to the plant glycosyltransferasetransmembrane domain (or portion thereof) which in turn would be linkedto the amino-terminal cytoplasmic tail of the plant glycosyltransferase,with the stem region being linked to the catalytic domain of themammalian glycosyltransferase (i.e. two of the four regions/domainswould be of plant origin and two would be of mammalian origin).

In other embodiments, the present invention contemplates nucleic acidencoding a hybrid enzyme (along with vectors, host cells containing thevectors, plants—or plant parts—containing the host cells), said hybridenzyme comprising at least a portion of an amino-terminal cytoplasmictail of a plant glycosyltransferase and at least a portion of acatalytic region of a mammalian glycosyltransferase. In this embodiment,the hybrid enzyme encoded by the nucleic acid might or might not containother plant sequences (e.g., the transmembrane domain or portionthereof, the stem region or portion thereof). For example, to createsuch an embodiment, the coding sequence for the entire cytoplasmic tail(or portion thereof) may be deleted from nucleic acid coding for themammalian glycosyltransferase and replaced with the coding sequence forthe entire cytoplasmic domain (or portion thereof) of a plantglycosyltransferase. In such a case, the resulting hybrid enzyme wouldhave the amino-terminal cytoplasmic tail (or portion thereof) of theplant glycosyltransferase linked to the mammalian glycosyltransferasetransmembrane domain, which in turn is linked to stem region of themammalian glycosyltransferase, the stem region being linked to thecatalytic domain of the mammalian glycosyltransferase. On the otherhand, a different approach might be taken to create this embodiment; forexample, the coding sequence for the entire catalytic domain (or portionthereof) may be deleted from the coding sequence for the plantglycosyltransferase and replaced with the coding sequence for the entirecatalytic domain (or portion thereof) of the mammalianglycosyltransferase. In such a case, the resulting hybrid enzyme wouldhave the amino-terminal cytoplasmic tail of the plantglycosyltransferase linked to the plant glycosyltransferasetransmembrane domain linked to the stem region of the plantglycosyltransferase in the normal manner of the wild-type plantenzyme—but the stem region would be linked to the catalytic domain ofthe mammalian glycosyltransferase (or portion thereof).

In the above discussion, the use of the phrase “or portion thereof” wasused to expressly indicate that less than the entire region/domain mightbe employed in the particular case (e.g., a fragment might be used). Forexample, the cytoplasmic tail of glycosyltransferases ranges fromapproximately 5 to 50 amino acids in length, and more typically 15 to 30amino acids, depending on the particular transferase. A “portion” of thecytoplasmic tail region is herein defined as no fewer than four aminoacids and can be as large as up to the full length of the region/domainless one amino acid. It is desired that the portion function in a manneranalogous to the full length region/domain—but need not function to thesame degree. For example, to the extent the full-length cytoplasmic tailfunctions as a Golgi retention region or ER retention signal, it isdesired that the portion employed in the above-named embodiments alsofunction as a Golgi or ER retention region, albeit perhaps not asefficiently as the full-length region.

Similarly, the transmembrane domain is typically 15-25 amino acids inlength and made up of primarily hydrophobic amino acids. A “portion” ofthe transmembrane domain is herein defined as no fewer than ten aminoacids and can be as large as up to the full length of the region/domain(for the particular type of transferase) less one amino acid. It isdesired that the portion function in a manner analogous to the fulllength region/domain—but need not function to the same degree. Forexample, to the extent the full-length transmembrane domain functions asthe primary Golgi retention region or ER retention signal, it is desiredthat the portion employed in the above-named embodiments also functionas a Golgi or ER retention region, albeit perhaps not as efficiently asthe full-length region. The present invention specifically contemplatesconservative substitutions to create variants of the wild-typetransmembrane domain or portions thereof. For example, the presentinvention contemplates replacing one or more hydrophobic amino acids(shown as “H” in the schematic above) of the wild-type sequence with oneor more different amino acids, preferably also hydrophobic amino acids.

A portion of the catalytic domain can be as large as the full length ofthe domain less on amino acid. Where the catalytic domain is from abeta1,4-galactosyltransferase, it is preferred that the portion includeat a minimum residues 345-365 which are believed to be involved in theconformation conferring an oligosaccharide acceptor binding site (it ispreferred that the portion include this region at a minimum and five toten amino acids on either side to permit the proper conformation).

The present invention also includes synthetic CTS regions and portionsthereof. A “portion” of a CTS region must include at least one (and mayinclude more than one) entire domain (e.g., the entire transmembranedomain) but less than the entire CTS region.

Importantly, by using the term “CTS region” or “transmembrane domain” itis not intended that only wild type sequences be encompassed. Indeed,this invention is not limited to natural glycosyltransferases andenzymes involved in glycosylation, but also includes the use ofsynthetic enzymes exhibit the same or similar function. In oneembodiment, wild type domains are changed (e.g. by deletion, insertion,replacement and the like).

Finally, by using the indicator “Tm” when referring to a particularhybrid (e.g., “TmXyl-), entire transmembrane/CTS domains (with orwithout changes to the wild-type sequence) as well as portions (with orwithout changes to the wild-type sequence) are intended to beencompassed.

SUMMARY OF THE INVENTION

The present invention contemplates nucleic acid (whether DNA or RNA)encoding hybrid enzymes (or “fusion proteins”), vectors containing suchnucleic acid, host cells (including but not limited to cells in planttissue and whole plants) containing such vectors an expressing thehybrid enzymes, and the isolated hybrid enzyme(s) themselves. In oneembodiment, expression of said hybrid enzymes (or “fusion proteins”)results in changes in glycosylation, such as, but not limited to,reduction of sugar moieties such as xylose, fucose, Lewis^(A/B/X) orother sugar structures that interfere with desired glycoformaccumulation. In one embodiment, the present invention contemplatesnucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising aCTS region (or portion thereof) of a glycosyltransferase (including butnot limited to a plant glycosyltransferase) and a catalytic region (orportion thereof) of a non-plant glycosyltransferase (e.g., mammalian,fish, amphibian, fungal). It is preferred that, when expressed, the CTSregion (or portion thereof) is linked (directly or indirectly) inoperable combination to said catalytic region (or portion thereof). Thelinking is preferably covalent and the combination is operable in thatthe catalytic region exhibits catalytic function (even if said catalyticfunction is reduced as compared to the wild-type enzyme). The linkingcan be direct in the sense that there are no intervening amino acids orother regions/domains. On the other hand, the linking can be indirect inthat there are intervening amino acids (or other chemical groups) and/orother regions/domains between them. Of course, the nucleic acid used tomake the nucleic acid encoding the above-described hybrid enzyme(s) canbe obtained enzymatically from a physical sequence (e.g. genomic DNA, acDNA, and the like) or alternatively, made synthetically using areference sequence (e.g. electronic or hardcopy sequence) as a guide.

In a particular embodiment, the present invention contemplates nucleicacid encoding a hybrid enzyme, said hybrid enzyme comprising atransmembrane region (e.g., at least a transmembrane region andoptionally more of the CTS region) of a plant glycosyltransferase and acatalytic region (or portion thereof) of a non-plant (such as amammalian) glycosyltransferase. Again, it is preferred that, whenexpressed, these regions are linked (directly or indirectly) in operablecombination. In yet another embodiment, the present inventioncontemplates nucleic acid encoding a hybrid enzyme, said hybrid enzymecomprising a transmembrane domain (or portion thereof) of a plantglycosyltransferase and a catalytic region (or portion thereof) of amammalian glycosyltransferase. Again, it is preferred that, whenexpressed, these regions are linked (directly or indirectly) in operablecombination.

It is not intended that the present invention be limited to particulartransferases. In one embodiment, the plant glycosyltransferase is axylosyltransferase. In another embodiment, the plant glycosyltransferaseis a N-acetylglucosaminyltransferase. In another embodiment, the plantglycosyltransferase is a fucosyltransferase. In a preferred embodiment,the mammalian glycosyltransferase is a human galactosyltransferase (suchas the human beta 1,4-galactosyltransferase encoded by SEQ ID NO:1wherein the nucleotides encoding the transmembrane domain are deletedand replaced).

It is not intended that the present invention is limited to the use of aplant-derived glycosyltransferase CTS-domain and a humanglycosyltransferase catalytic domain but also vice versa and the use ofany CTS-domain of a glycosyltransferase in combination with thecatalytic fragment of at least one other glycosyltransferase. Indeed,the present invention broadly contemplates, in one embodiment, nucleicacid encoding a hybrid enzyme, said hybrid enzyme comprising atransmembrane region of a first glycosyltransferase and a catalyticregion of a second glycosyltransferase. It is preferred that said firstand second glycosyltransferases are from different species (and can befrom a different genus or even from a different phylum). In oneembodiment, said first glycosyltransferase comprises a plantglycosyltransferase. In another embodiment, said plantglycosyltransferase is a xylosyltransferase. In yet another embodiment,said plant glycosyltransferase is a fucosyltransferase. In a preferredembodiment said second glycosyltransferase comprises a mammalianglycosyltransferase. In a particularly preferred embodiment, saidmammalian glycosyltransferase is a human galactosyltransferase.

It is not intended that the present invention be limited tocircumstances where the first and second glycosyltransferases are plantand non-plant, respectively. In one embodiment, said firstglycosyltransferase comprises a first mammalian glycosyltransferase andsaid second glycosyltransferase comprises a second mammalianglycosyltransferase. In a preferred embodiment, said first mammalianglycosyltransferase is a non-human glycosyltransferase and said secondmammalian glycosyltransferase is a human glycosyltransferase.

It is not intended that the present invention be limited to the type ofvector. In one embodiment, the present invention contemplates anexpression vector, comprising the nucleic acid encoding theabove-described hybrid enzyme.

It is also not intended that the present invention be limited to thetype of host cells. A variety of prokaryotic and eukaryotic host cellsare commercially available for expressing proteins. In one embodiment,the present invention contemplates a host cell containing the vectorcomprising the nucleic acid encoding the above-described hybrid enzyme(with or without other vectors or other nucleic acid encoding otherhybrid enzymes or glycosyltransferases). In a preferred embodiment, thehost cell is a plant cell. In a particularly preferred embodiment, thepresent invention contemplates a plant comprising such a host cell.

It is not intended that the present invention be limited by the methodby which host cells are made to express the hybrid enzymes of thepresent invention. In one embodiment, the present invention contemplatesa method, comprising: a) providing: i) a host cell (such as a plantcell, whether in culture or as part of plant tissue or even as part ofan intact growing plant), and ii) an expression vector comprisingnucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising atleast a portion of a CTS region of a plant glycosyltransferase (e.g. thetransmembrane domain) and at least a portion of a catalytic region of amammalian glycosyltransferase; and b) introducing said expression vectorinto said plant cell under conditions such that said hybrid enzyme isexpressed. Again, it is not intended that the present invention belimited to particular transferases. In one embodiment, the plantglycosyltransferase used in the above-described method is axylosyltransferase. In another embodiment, the plant glycosyltransferaseis a N-acetylglucosaminyltransferase. In another embodiment, the plantglycosyltransferase is a fucosyltransferase. In a preferred embodimentthe mammalian glycosyltransferase used in the above-described method isa human galactosyltransferase (such as the human beta1,4-galactosyltransferase encoded by SEQ ID NO:1 wherein the nucleotidesencoding the transmembrane domain are deleted and replaced) (or simplywhere the nucleotides of SEQ ID NO:1 encoding the catalytic domain, orportion thereof, are taken and linked to nucleotides encoding the CTSregion, or portion thereof, of a plant glycosyltransferase.).

It is not intended that the present invention be limited to a particularscheme for controlling glycosylation of a heterologous protein using thehybrid enzymes described above. In one embodiment, the present inventioncontemplates a method, comprising: a) providing: i) a host cell (such asa plant cell), ii) a first expression vector comprising nucleic acidencoding a hybrid enzyme, said hybrid enzyme comprising at least aportion of a CTS region (e.g. at least a transmembrane domain) of afirst (such as a plant) glycosyltransferase and at least a portion of acatalytic region of a second (such as a mammalian) glycosyltransferase,and iii) a second expression vector comprising nucleic acid encoding aheterologous glycoprotein; (or portion thereof; and b) introducing saidfirst and second expression vectors into said plant cell underconditions such that said hybrid enzyme and said heterologous proteinare expressed. Alternatively, a single vector with nucleic acid encodingboth the hybrid enzyme (or hybrid enzymes) and the heterologousglycoprotein might be used. Regardless of which method is used, theinvention contemplates, in one embodiment, the additional step (c) ofisolating the heterologous protein—as well as the isolated proteinitself as a composition.

On the other hand, the present invention also contemplates introducingdifferent vectors into different plant cells (whether they are cells inculture, part of plant tissue, or even part of an intact growing plant).In one embodiment, the present invention contemplates a method,comprising: a) providing: i) a first plant comprising a first expressionvector, said first vector comprising nucleic acid encoding a hybridenzyme (or encoding two or more hybrid enzymes), said hybrid enzymecomprising at least a portion of a CTS region (e.g. the firstapproximately 40-60 amino acids of the N-terminus) of a plantglycosyltransferase and at least a portion of a catalytic region of amammalian glycosyltransferase, and ii) a second plant comprising asecond expression vector, said second vector comprising nucleic acidencoding a heterologous protein (or portion thereof); and crossing saidfirst plant and said second plant to produce progeny expressing saidhybrid enzyme and said heterologous protein. Of course, such progeny canbe isolated, grown up, and analyzed for the presence of each (or both)of the proteins. Indeed, the heterologous protein can be used (typicallyfirst purified substantially free of plant cellular material)therapeutically (e.g., administered to a human or animal, whetherorally, by intravenous, transdermally or by some other route ofadministration) to treat or prevent disease.

It is not intended that the present invention be limited to a particularheterologous protein. In one embodiment, any peptide or protein that isnot endogenous to the host cell (or organism) is contemplated. In oneembodiment, the heterologous protein is an antibody or antibodyfragment. In a particularly preferred embodiment, the antibody is ahuman antibody or “humanized” antibody expressed in a plant in highyield. “Humanized” antibodies are typically prepared from non-humanantibodies (e.g. rodent antibodies) by taking the hypervariable regions(the so-called CDRs) of the non-human antibodies and “grafting” them onto human frameworks. The entire process can be synthetic (provided thatthe sequences are known) and frameworks can be selected from a databaseof common human frameworks. Many times, there is a loss of affinity inthe process unless either the framework sequences are modified or theCDRs are modified. Indeed, increases in affinity can be revealed whenthe CDRs are systematically mutated (for example, by randomizationprocedures) and tested.

While the present invention is particularly useful in the context ofheterologous proteins, in one embodiment, the hybrid enzymes of thepresent invention are used to change the glycosylation of endogenousproteins, i.e. proteins normally expressed by the host cell or organism.

The present invention specifically contemplates the plants themselves.In one embodiment, the present invention contemplates a plant,comprising first and second expression vectors, said first vectorcomprising nucleic acid encoding a hybrid enzyme, said hybrid enzymecomprising at least a portion of a CTS region (e.g. the cytoplasmic tailtogether with at least a portion of the transmembrane domain) of a plantglycosyltransferase and at least a portion of a catalytic region of amammalian glycosyltransferase, said second expression vector, saidsecond vector comprising nucleic acid encoding a heterologous protein(or portion thereof). In a preferred embodiment, by virtue of beingexpressed along with the hybrid enzyme (or hybrid enzymes) of thepresent invention, the heterologous protein displays reduced (10% to99%) alpha 1,3-fucosylation (or even no fucosylation), as compared towhen the heterologous protein is expressed in the plant in the absenceof the hybrid enzyme (or enzymes). In a preferred embodiment, by virtueof being expressed along with the hybrid enzyme (or hybrid enzymes) ofthe present invention, the heterologous protein displays reduced (10% to99%) xylosylation (or even no xylose), as compared to when theheterologous protein is expressed in the plant in the absence of thehybrid enzyme (or enzymes). In a preferred embodiment, by virtue ofbeing expressed along with the hybrid enzyme (or hybrid enzymes) of thepresent invention, the heterologous protein displays both reduced fucoseand xylose, as compared to when the heterologous protein is expressed inthe plant in the absence of the hybrid enzyme (or enzymes).

It is not intended that the present invention be limited to a particulartheory by which reduced fucose and/or xylose is achieved. Very little isknown about the sub-Golgi sorting mechanism in plants. The mammalianspecific β(1,4)-galactosyltransferase (GalT) has been used (see theExamples below) as an excellent first marker to study this phenomenonsince it generates glycan structures not normally found in plants. Theglycan structures of plants that express galactosyltransferase has beencompared with glycan structures from plants that express a chimericgalactosyltransferase of which the CTS domain is exchanged for that of aplant xylosyltransferase (or portion thereof). The change in observedglycan structures show that the galactosyltransferase is, as in mammals,confined to a specific sub-compartment of the plant Golgi. Withoutlimiting the invention to any particular mechanism, the sortingmechanism of plants and mammals are apparently conserved even to theextent that glycosyltransferases unknown to plants are routed tospecific analogous location in the Golgi. This location is later in theGolgi than where the endogenous xylosyl-, fucosyl- and GlcNAcTII (GnTII)transferases are located.

The finding that N-glycans in these plants that express relocalisedvariants of GalT containing significantly less xylose and fucose is alsoof biotechnological relevance. For glycoproteins intended fortherapeutic use in mammals, such as humans, the approach of certainembodiments of the present invention provides methods and compositionsfor controlling N-linked glycosylation of glycoproteins in plants sothat glycoprotein essentially free of xylose and fucose and containingat least a bi-antennary N-glycans (but not limited to bi-antennary, alsoinclude tri-antennary, and the like) and (at least one) galactoseresidue on at least one of the arms of the N-glycan can be obtained.Hence, it is not intended that the present invention is limited tobi-antennary N-glycans but also includes bisected bi-antennaryN-glycans, tri-antennary N-glycans, and the like. Furthermore, theinvention is not limited to complex-type N-glycans but also includeshybrid-type N-glycans and other type N-glycans. The present inventioncontemplates such resulting glyco-proteins. In addition, the methods andcompositions of the present invention may be applicable for plants andnon-plant systems where besides xylose, fucose, Lewis^(A/B/X) typeN-glycan modifications (β1-3-GalT, α1-4-FucT, other) or other sugars,“interfere” with desired glycoform accumulation.

In one embodiment, the invention is directed to controlling N-linkedglycosylation of plants by modulating the localization of enzymesinvolved in glycan biosynthesis in the Golgi apparatus. Specifically,embodiments of the invention are directed to a method of producing in aplant host system a glycoprotein having bi-antennary glycans andcontaining at least one galactose residues on at least one of the armsand which are devoid (or reduced in) of xylose and fucose,comprising:(a) preventing (or inhibiting) addition of xylose and fucoseon the core of the glycan of said glycoprotein and (b) adding one orpreferably two galactose residues to said arms.

Addition of xylose and fucose to said heterologous glycoprotein may bereduced or even prevented by introducing to said plant host system anucleic acid encoding a hybrid enzyme comprising a CTS region (orportion thereof) of a protein, particularly an enzyme such as plantxylosyltransferase and catalytic region (or portion thereof) of agalactosyltransferase not normally found in a plant, or a modifiedgalactosyltransferase where its transmembrane portion has been removedand endoplasmic reticulum retention signal have been inserted, whereinsaid protein or enzyme acts earlier in the Golgi apparatus of a plantcell in said plant host system than said galactosyltransferase. It ispreferred that the galactosyltransferase is a mammaliangalactosyltransferase and in particular, a human galactosyltransferase.In a most specific embodiment, said galactosyltransferase is human β1,4galactosyltransferase (GalT). In a preferred embodiment, saidxylosyltransferase is a β1,2-xylosyltransferase. The exchange of the CTSregion or CTS fragment of a mammalian glycosyltransferase (such as agalactosyltransferase) by one from the group of enzymes that act earlierin the Golgi apparatus than galactosyltransferase including but notlimited to those from of XylT, FucT, GnTI, GnTII, GnTIII, GnTIV, GnTV,GnTVI, ManI, ManII and ManIII results in strongly reduced amounts ofglycans that contain the undesired xylose and fucose residues (see FIG.2). In addition, galactosylation is improved and the diversity inglycans is reduced. While not limited to any particular mechanism, theincrease in galactosylated glycans that carry neither xylose nor fucoseis believed to be mainly attributed to the accumulation of GalGNMan5,GNMan5 or GalGNMan4. Also, galactosylation occurs on one glycan armonly. Apparently, the galactosylation earlier in the Golgi inhibitstrimming of the said glycoforms by Mannosidase II (ManII) to GalGNMan3.Also addition of the second GlcNAc by GlcNAcTII (GnTII) is inhibited.

Therefore, in one embodiment, a further step is contemplated to obtainthe desired glycoprotein that has both arms galactosylated and yet isessentially devoid of xylose and fucose. Thus, in one embodiment, themethod of the invention as noted above further comprises addinggalactose residues to the arms of said glycoprotein (see FIG. 3). In oneembodiment of the invention, galactose residues are added onto both armsby introducing to said plant host system (a) a nucleic acid sequenceencoding a first hybrid enzyme comprising the CTS region (or fragment,such as one including the transmembrane domain) of GnTI and the activedomain (or portion thereof) of GnTII; (b) a nucleic acid sequenceencoding the second hybrid enzyme comprising the CTS region (orfragment, such as one including the transmembrane of GnTI and the activedomain of ManII and (c) a nucleic acid sequence encoding a third hybridenzyme comprising the CTS region (or fragment, such as one including thetransmembrane domain) of XylT and the active domain (or portion thereof)of human galactosyltransferse (TmXyl-GalT). In another embodiment of theinvention, galactose residues are added onto both arms by introducing tosaid plant host system (a) a nucleic acid sequence encoding a firsthybrid enzyme comprising the CTS region (or fragment, such as oneincluding the transmembrane domain) of ManI and the active domain (orportion thereof) of GnTI; (b) a nucleic acid sequence encoding thesecond hybrid enzyme comprising the CTS region (or fragment, such as oneincluding the transmembrane domain) of ManI and the active domain (orportion thereof) of GnTII; (c) a nucleic acid sequence encoding thethird hybrid enzyme comprising the CTS region (or fragment, such as oneincluding the transmembrane domain) of ManI and the active domain (orportion thereof) of ManII, and (d) a nucleic acid sequence encoding afourth hybrid enzyme comprising the CTS region (or fragment, such as oneincluding the transmembrane domain) of XylT and the active domain (orportion thereof) of human galactosyltransferse (TmXyl-GalT).

It is not intended that the present invention be limited to particularcombinations of hybrid enzymes or the number of such hybrid enzymesemployed in a single cell, plant tissue or plant. In a preferredembodiment, the present invention contemplates host cells expressingTmXyl-GalT plus TmGnTI-GnTII plus TmGnTI-ManII. In one embodiment of theinvention, galactose residues are added to said arms by introducing tosaid plant host system (a) a nucleic acid sequence encoding a firsthybrid enzyme comprising a CTS region (or fragment thereof) of aprotein, particularly an enzyme, including but not limited toN-acetylglucosaminyltransferase I (GnTI) and a catalytic region (orportion thereof) of a mannosidase II (ManII), wherein said enzyme actsearlier in the Golgi apparatus of a plant cell in said plant host systemthan said mannosidase II or modified mannosidase II where itstransmembrane portion has been deleted and endoplasmic reticulumretention signal have been inserted and (b) a nucleic acid sequenceencoding a second hybrid enzyme comprising a CTS region (or fragment,such as one including the transmembrane domain) of an enzyme includingbut not limited to N-acetyl-glucosaminyltransferase I (GnTI) and acatalytic region (or portion thereof) of aN-acetylglucosaminyl-transferase II (GnTII), wherein said enzyme actsearlier in the Golgi apparatus of a plant cell in said plant host systemthan said N acetylglucosaminyl-transferaseII (GnTII) or modifiedN-acetylglucosaminyltransferase II (GnTII) where its transmembraneportion has been deleted and an endoplasmic reticulum retention signalhave been inserted. The sequences encodingN-acetylglucosaminyltransferases or mannosidase II or the saidtransmembrane fragments can originate form plants or from eukaryoticnon-plant organisms (e.g., mammals).

In yet another preferred embodiment, the present invention contemplatesa host cell expressing TmXyl-GalT plus TmManI-GnTI plus TmManI-ManIIplus TmManI-GnTII. In another embodiment of the invention, galactoseresidues are added to said arms by introducing to said plant host system(a) a nucleic acid sequence encoding a first hybrid enzyme comprising aCTS region (or fragment, such as one including the transmembrane domain)of a protein, particularly an enzyme, including but not limited toMannosidase I (ManI) and a catalytic region (or portion thereof) of a Nacetylglucosaminyltransferase I (GnTI), wherein said enzyme acts earlierin the Golgi apparatus of a plant cell in said plant host system thansaid N-acetylglucosaminyl-transferase I (GnTI) or modified Nacetylglucosaminyltransferase I (GnTI) where its transmembrane portionhas been deleted and endoplasmic reticulum retention signal have beeninserted and (b) a nucleic acid sequence encoding a second hybrid enzymecomprising a CTS region (or fragment, such as one including thetransmembrane domain) of an enzyme including but not limited toMannosidase I (ManI) and a catalytic region (or portion thereof) of aMannosidase II (ManII), wherein said enzyme acts earlier in the Golgiapparatus of a plant cell in said plant host system than saidMannosidase II (ManII) or modified Mannosidase II (ManII) where itstransmembrane portion has been deleted and an endoplasmic reticulumretention signal have been inserted and (c) a nucleic acid sequenceencoding a third hybrid enzyme comprising a CTS region (or fragment,such as one including the transmembrane domain) of an enzyme includingbut not limited to Mannosidase I (ManI) and a catalytic region (orportion thereof) of a N-acetylglucos-aminyltransferase II (GnTII),wherein said enzyme acts earlier in the Golgi apparatus of a plant cellin said plant host system than said N-acetylglucosaminyltransferase II(GnTII) or modified N-acetylglucosaminyltransferase II (GnTII) where itstransmembrane portion has been deleted and an endoplasmic reticulumretention signal have been inserted. The sequences encodingN-acetylglucosaminyltransferases or mannosidases or the saidtransmembrane fragments can originate from plants or from eukaryoticnon-plant organisms (e.g., mammals).

In still another preferred embodiment, the present inventioncontemplates host cells expressing TmXyl-GalT plus ManII. In anotherembodiment of the invention, galactose residues are added to said armsby introducing to said plant host system (a) a nucleic acid sequenceencoding a Mannosidase III (ManIII, wildtype gene sequence but notlimited to: also ManII with endoplasmic reticulum retention signal;ManIII with transmembrane fragment of early (cis-) Golgi apparatusglycosyltransferase (GnTI, ManI, GnTIII). The sequences encodingMannosidase III can originate form insects, preferably from Spodopterafrugiperda or Drosophila melanogaster (but not limited to), human orfrom other organisms.

In still another preferred embodiment, the present inventioncontemplates a host cell expressing TmXyl-GalT plus ManIII plusTmGnTI-GnTII. In yet another preferred embodiment, the present inventioncontemplates a host cell expressing TmXyl-GalT plus ManIII plusTmManI-GnTI plus TmManI-GnTII.

The method of the invention may optionally comprise, in one embodiment,introducing into said plant host system a mammalianN-acetylglucosaminyltransferase GnTIII, particularly a human GnTIII orhybrid protein comprising a catalytic portion of mammalian GnTIII and atransmembrane portion of a protein, said protein residing in the ER orearlier compartment of the Golgi apparatus of a eukaryotic cell. Forexample, in one embodiment, the hybrid enzyme TmXyl-GnTIII iscontemplated (along with nucleic acid coding for such a hybrid enzyme,vectors containing such nucleic acid, host cells containing suchvectors, and plants—or plant parts—containing such host cells). Inanother embodiment, the hybrid enzyme TmFuc-GnTIII is contemplated(along with nucleic acid coding for such a hybrid enzyme, vectorscontaining such nucleic acid, host cells containing such vectors, andplants—or plant parts—containing such host cells). The present inventionspecifically contemplates host cells expressing such hybrid enzymes(with or without additional hybrid enzymes or otherglycosyltransferases).

The invention is further directed to said hybrid and modified enzymes,nucleic acid sequences encoding said hybrid enzymes, vectors comprisingsaid nucleic acid sequences and methods for obtaining said hybridenzymes. Furthermore, the invention is directed to a plant host systemcomprising a heterologous glycoprotein having preferably complex typebi-antennary glycans and containing at least one galactose residue on atleast one of the arms and are devoid of xylose and fucose. A“heterologous glycoprotein” is a glycoprotein originating from a speciesother than the plant host system. The glycoprotein may include but isnot limited to antibodies, hormones, growth factors and growth factorreceptors and antigens.

Indeed, the present invention is particularly useful for controlling theglycosylation of heterologous glycoproteins, such as antibodies orantibody fragments (single chain antibodies, Fab fragments, Fab₂fragments, Fv fragments, and the like). To control the glycosylation ofan antibody, the gene construct encoding a hybrid enzyme of the presentinvention (e.g., the TmXyl-GalT gene construct) can be introduced intransgenic plants expressing an antibody (e.g., monoclonal antibody) orantibody fragment. On the other hand, the gene(s) encoding the antibody(or antibody fragment) can be introduced by retransformation of plantexpressing TmXyl-GalT gene construct. In still another embodiment, thebinary vector harbouring the TmXyl-GalT expression cassette can beco-transformed to plants together with a plant binary vector harbouringthe expression cassettes comprising both light and heavy chain sequencesof a monoclonal antibody on a single T-DNA or with binary vectorsharbouring the expression cassettes for light and heavy chain sequencesboth separately on independent T-DNA's but both encoding a monoclonalantibody. The present invention specifically contemplates, in oneembodiment, crossing plants expressing antibodies with plant expressingthe hybrid glycosyltransferase(s) of the present invention.

A “host system” may include but is not limited to any organismcontaining glycoproteins with N-glycans.

A “plant host system” may include but is not limited to a plant orportion thereof, which includes but is not limited to a plant cell,plant organ and/or plant tissue. The plant may be a monocotyledon(monocot) which is a flowering plant whose embryos have one cotyledon orseed leaf and includes but is not limited to lilies, grasses, corn (Zeamays), rice, grains including oats, wheat and barley, orchids, irises,onions and palms. Alternatively, the plant may be a dicotyledenon(dicot) which includes but is not limited to tobacco (Nicotiana),tomatoes, potatoes, legumes (e.g, alfalfa and soybeans), roses, daises,cacti, violets and duckweed. The plant may also be a moss which includesbut is not limited to Physcomitrella patens.

The invention is further directed to a method for obtaining said planthost system. The method comprises crossing a plant expressing aheterologous glycoprotein with a plant comprising (a) a hybrid enzymecomprising a catalytic region (or portion thereof) of agalactosyltransferase not normally found in a plant and a CTS region (orfragment, such as one including the transmembrane domain) of a protein,wherein said protein acts earlier in the Golgi apparatus of a plant cellin said plant host system than said galactosyltransferase or a modifiedgalactosyltransferase where its transmembrane portion has been deletedand endoplasmic reticulum retention signal has been inserted; (b) ahybrid enzyme comprising a CTS region (or portion thereof, such as oneincluding the transmembrane domain) of a protein, particularly anenzyme, including but not limited to N-acetylglucosaminyltransferase I(GnTI) and a catalytic region (or portion thereof) of a mannosidase II(ManII), wherein said enzyme acts earlier in the Golgi apparatus of aplant cell in said plant host system than said mannosidase II ormodified mannosidase II where its transmembrane portion has been deletedand endoplasmic reticulum retention signal have been inserted and (c) ahybrid enzyme comprising at least a transmembrane region of an enzyme(such as the first 40-60 amino acids of the N-terminus) of aglycosyltransferase including but not limited toN-acetylglucosaminyltransferase I (GnTI) and a catalytic region of aN-acetylglucos-aminyltransferase II (GnTII), wherein said enzyme actsearlier in the Golgi apparatus of a plant cell in said plant host systemthan said N acetylglucosaminyltransferase II (GnTII) or modifiedN-acetylglucosaminyl-transferase II (GnTII) where its transmembraneportion has been deleted and an endoplasmic reticulum retention signalhave been inserted., harvesting progeny from said crossing and selectinga desired progeny plant expressing said heterologous glycoprotein.

The invention is further directed to said plant or portion thereof whichwould constitute a plant host system. Said plant host system may furthercomprise a mammalian GnTIII enzyme or hybrid protein comprising acatalytic portion of mammalian GnTII and a transmembrane portion of aprotein, said protein residing in the ER or earlier compartment of theGolgi apparatus of a eukaryotic cell.

Additionally, the invention also provides the use of a plant host systemto produce a desired glycoprotein or functional fragment thereof. Theinvention additionally provides a method for obtaining a desiredglycoprotein or functional fragment thereof comprising cultivating aplant according to the invention until said plant has reached aharvestable stage, for example when sufficient biomass has grown toallow profitable harvesting, followed by harvesting said plant withestablished techniques known in the art and fractionating said plantwith established techniques known in the art to obtain fractionatedplant material and at least partly isolating said glycoprotein from saidfractionated plant material.

Alternatively, said plant host cell system comprising said heterologousglycoprotein may also be obtained by introducing into a plant host cellsystem or portion thereof (a) a nucleic acid sequence encoding a hybridenzyme comprising a catalytic region of a galactosyltransferase notnormally found in a plant and at least the transmembrane region (or moreof the CTS) of a protein, wherein said protein acts earlier in the Golgiapparatus of a plant cell in said plant host system than saidgalactosyltransferase or a modified galactosyltransferase where itstransmembrane portion has been deleted and endoplasmic reticulumretention signal have been inserted; (b) a nucleic acid sequenceencoding a first hybrid enzyme comprising at least the transmembraneregion (or more of the CTS if desired) of a protein, particularly anenzyme, including but not limited to N-acetylglucosaminyltransferase I(GnTI) and a catalytic region of a mannosidase II (ManII), wherein saidenzyme acts earlier in the Golgi apparatus of a plant cell in said planthost system than said mannosidase II, or modified mannosidase II whereits transmembrane portion has been deleted and endoplasmic reticulumretention signal have been inserted and (c) a nucleic acid sequenceencoding a second hybrid enzyme comprising at least a transmembraneregion (more of the CTS if desired) of an enzyme including but notlimited to N-acetylglucosaminyl-transferase I (GnTI) and a catalyticregion of a N-acetylglucosaminyltransferase II (GnTII), wherein saidenzyme acts earlier in the Golgi apparatus of a plant cell in said planthost system than said N- acetylglucos-aminyltransferase-II (GnTII) ormodified N-acetylglucosaminyltransferase II (GnTII) where itstransmembrane portion has been deleted and an endoplasmic reticulumretention signal have been inserted. and isolating a plant or portionthereof expressing said heterologous glycoprotein (or portion thereof).In one embodiment, one vector comprising all of the nucleic acidsequences is introduced into said plant host system. In anotherembodiment, each nucleic acid sequence is inserted into separate vectorsand these vectors are introduced into said plant host system. In anotherembodiment combinations of two or more nucleic acid sequences areinserted into separate vectors which are than combined into said planthost system by retransformation or co-transformation or by crossing.

The invention also provides use of such a plant-derived glycoprotein orfunctional fragment thereof according to the invention for theproduction of a composition, particularly, pharmaceutical composition,for example for the treatment of a patient with an antibody, a hormone,a vaccine antigen, an enzyme, or the like. Such a pharmaceuticalcomposition comprising a glycoprotein or functional fragment thereof isnow also provided.

Finally, it is contemplated that the above-described approach may beuseful in reducing the overall diversity in glycans in plants expressingone or more of the hybrid enzymes of the present invention (as comparedto wild-type plants or plants simply transformed with only mammalianGalT).

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 compares the glycosylation pathway of glycoproteins in plants andin mammals.

FIG. 2 shows the effect of exchanging the CTS fragment ofgalactosyltransferase with xylosyltransferase

FIG. 3 shows the further effect of relocalizing mannosidase II andGlcNAcTII.

FIG. 4 top panel shows a T-DNA construct carrying the genes encodingglycan modifying enzymes to produce efficiently galactosylated glycansthat are devoid of immunogenic xylose and fucose and the bottom panelshows a T-DNA construct carrying antibody light chain and heavy chaingenes.

FIG. 5 shows the nucleic acid sequence (SEQ ID NO:1) for a humangalactosyltransferase (human B1,4-galactosyltransferase—GalT).

FIG. 6 shows the nucleic acid sequence of FIG. 5 along with thecorresponding amino acid sequence (SEQ ID NO:2).

FIG. 7 shows an illustrative mutated sequence (SEQ ID NO:59) derived thewild type amino acid sequence (SEQ ID NO:2) for a humangalactosyltransferase, wherein a serine has been deleted from thecytoplasmic tail and a G-I-Y motif has been repeated. Of course, suchchanges are merely illustrative of the many possible changes within thescope of the present invention. For example, in one embodiment, thepresent invention contemplates mutated sequences wherein only deletions(one or more) are employed (e.g. deletions in the cytoplasmic taildomain or the stem domain)—with no insertions or repeats. Similarly, inone embodiment, the present invention contemplates mutated sequenceswherein only (one or more) insertions or replacements (e.g. in thetransmembrane domain) are employed—with no deletions.

FIG. 8 shows the nucleic acid sequence (SEQ ID NO:3) encoding a hybridenzyme comprising human galactosyltransferase (humanB1,4-galactosyltransferase—GalT). The upper case letters are nucleotidesof Arabidopsis thaliana mRNA for beta 1,2-xylosyltransferase (databaseentry: EMBL:ATH277603, the TmXyl-fragment used involves nucleotides135-297 of this database sequence).

FIG. 9 shows the nucleic acid sequence of FIG. 8 along with thecorresponding amino acid sequence (SEQ ID NO:4).

FIG. 10 shows the amino acid sequence (SEQ ID NO:4) for the hybridenzyme encoded by the nucleic acid shown in FIG. 8.

FIG. 11 shows the nucleic acid sequence (SEQ ID NO:5) for the humanglycosyltransferase GnTIII (along with additional sequence encoding amyc-tag) (primary accession number Q09327 GNT3 HUMAN).

FIG. 12 shows the nucleic acid sequence of FIG. 11 along with thecorresponding amino acid sequence (SEQ ID NO:6).

FIG. 13 shows the amino acid sequence (SEQ ID NO:6) for a human GnTIII(along with additional amino acid sequence of the myc epitope tag SEQ IDNO:7).

FIG. 14 shows the nucleic acid sequence (SEQ ID NO:9) encoding oneembodiment of a hybrid enzyme of the present invention, said hybridenzyme comprising the transmembrane domain of a plant xylosyltransferase(TmXyl-) and the catalytic domain (along with other regions) for humanGnTIII (TmXyl-GnTIII) (along with additional sequence encoding amyc-tag).

FIG. 15 shows the nucleic acid sequence of FIG. 14 along with thecorresponding amino acid sequence (SEQ ID NO:10).

FIG. 16 shows the amino acid sequence (SEQ ID NO:10) for hybrid enzymeencoded by the nucleic acid of FIG. 14 (along with additional sequencefor the myc epitope tag SEQ ID NO:7).

FIG. 17 shows the complete nucleic acid sequence (SEQ ID NO:27) for acassette encoding the hybrid enzymes TmXyl-GalT plus TmGnTI-GnTII plusTmGnTI-ManII).

FIG. 18 shows the complete nucleic acid sequence (SEQ ID NO:28) for acassette encoding the hybrid enzyme TmGnTI-ManII (with the RbcS1promoter sequence SEQ ID NO:39 shown).

FIG. 19 shows the nucleic acid sequence (SEQ ID NO:29) encoding thehybrid enzyme TmGnTI-ManII.

FIG. 20 shows the nucleic acid sequence (SEQ ID NO:30) encoding thehybrid enzyme TmGnTI-GnTII.

FIG. 21 shows the nucleic acid sequence (SEQ ID NO:31) encoding thehybrid enzyme TmGnTI-GnTII, wherein the transmembrane fragment used(designated TmGntI) has the nucleic acid sequence set forth in SEQ IDNO:32.

FIG. 22A shows the nucleic acid sequence (SEQ ID NO:32) encoding oneembodiment of a transmembrane domain fragment (TmGnTI). FIG. 22B showsthe nucleic acid sequence (SEQ ID NO:33) encoding another embodiment ofa transmembrane domain fragment (TmManI).

FIG. 23 shows the complete nucleic acid sequence (SEQ ID NO:34) for atriple cassette embodiment of the present invention.

FIG. 24 shows the nucleic acid sequence (SEQ ID NO:35) for a hybrid geneexpression cassette (TmManI-GnTI).

FIG. 25 shows the nucleic acid sequence (SEQ ID NO:36) for the histone3.1 promoter.

FIG. 26 shows the nucleic acid sequence (SEQ ID NO:37) for the hybridgene fusion (TmManI-TmGnTI).

FIG. 27 shows the nucleic acid sequence (SEQ ID NO:38) for the hybridgene fusion TmManI-ManII (with the RbcS1 promoter sequence SEQ ID NO:39shown).

FIG. 28 shows the nucleic acid sequence (SEQ ID NO:39) for the RbcS1promoter.

FIG. 29 shows the nucleic acid sequence (SEQ ID NO:40) for the hybridgene TmManI-ManII wherein the nucleic acid sequence (SEQ ID NO:33)encoding the transmembrane fragment is shown.

FIG. 30 shows the nucleic acid sequence (SEQ ID NO:41) for the hybridgene TmManI-GnTII.

FIG. 31 shows the nucleic acid sequence (SEQ ID NO:42) for the Lhcapromoter.

FIG. 32 shows the nucleic acid sequence (SEQ ID NO:43) for the hybridgene TmManI-GnTII wherein the nucleic acid sequence (SEQ ID NO:33)encoding the transmembrane fragment is shown

FIG. 33 shows the nucleic acid sequence (SEQ ID NO:44) for theterminator sequence used (see below).

FIG. 34 is a Western Blot which examines total protein glycosylation ofplants of the present invention compared to control plants.

FIG. 35 is a lectin blot with RCA on F1 progeny of crossed plants, saidprogeny made according to one embodiment of the present invention

FIG. 36 is a Western Blot. Panel A was assayed with anti-IgG antibody.Panel B was assayed with an anti-HRP antibody. Panel C was assayed witha specific anti-Xyl antibody fraction. Panel D was assayed with aspecific anti-Fucose antibody fraction. Panel E was assayed with thelectin RCA.

FIG. 37 shows the nucleic acid sequence (SEQ ID NO:49) of a hybrid genewherein the aminoterminal CTS region of an insect Mannosidase III geneis replaced by a mouse signal peptide and a carboxyterminal endoplasmicreticulum retention signal (KDEL) was added.

FIG. 38 shows the corresponding amino acid sequence (SEQ ID NO:50) forthe nucleic acid sequence of FIG. 37.

FIG. 39 shows the nucleic acid sequence (SEQ ID NO:5 1) of a hybrid genewherein the aminoterminal CTS region of a humanbeta-1,4-galactosyltransferase (GalT) gene is replaced by a mouse signalpeptide and a carboxyterminal endoplasmic reticulum retention signal(KDEL) was added.

FIG. 40 shows the corresponding amino acid sequence (SEQ ID NO:52) forthe nucleic acid sequence of FIG. 39.

FIG. 41 shows the nucleic acid sequence (SEQ ID NO:53) of a hybrid genewherein the aminoterminal CTS region of an Arabidopsis thaliana GnTIgene is replaced by a mouse signal peptide and a carboxyterminalendoplasmic reticulum retention signal (KDEL) was added.

FIG. 42 shows the corresponding amino acid sequence (SEQ ID NO:54) forthe nucleic acid sequence of FIG. 41.

FIG. 43 shows the nucleic acid sequence (SEQ ID NO:55) of a hybrid genewherein the aminoterminal CTS region of an Arabidopsis thaliana GnTIIgene is replaced by a mouse signal peptide and a carboxyterminalendoplasmic reticulum retention signal (KDEL) was added.

FIG. 44 shows the corresponding amino acid sequence (SEQ ID NO:56) forthe nucleic acid sequence of FIG. 43.

FIG. 45 shows the nucleic acid sequence (SEQ ID NO:57) of a hybrid genewherein the aminoterminal CTS region of a humanbeta-1,4-galactosyltransferase (GalT) gene is replaced by the CTS regionof the human gene for GnTI.

FIG. 46 shows the corresponding amino acid sequence (SEQ ID NO:58) forthe nucleic acid sequence of FIG. 45.

FIG. 47 is a schematic of how enzymes might be localized to the Golgi.

FIG. 48 is a non-limiting speculative schematic of how the “swapping” ofregions of transferases might cause relocalization.

DETAILED DESCRIPTION OF THE INVENTION

Hybrid Enzymes

The nucleic acid sequences encoding the various glycosylation enzymessuch as mannosidases, GlcNAcTs, galactosyltransferases may be obtainedusing various recombinant DNA procedures known in the art, such aspolymerase chain reaction (PCR) or screening of expression libraries todetect cloned DNA fragments with shared structural features. See, e.g.,Innis et al., 1990, PCR: A Guide to Methods and Application, AcademicPress, New York. Other nucleic acid amplification procedures such asligase chain reaction (LCR), ligated activated transcription (LAT) andnucleic acid sequence-based amplification (NASBA) or long range PCR maybe used.

Once the DNA fragments are generated, identification of the specific DNAfragment containing the desired gene may be accomplished in a number ofways. For example, if an amount of a portion of a gene or its specificRNA, or a fragment thereof, is available and can be purified andlabeled, the generated DNA fragments may be screened by nucleic acidhybridization to the labeled probe [Benton and Davis, Science 196:180(1977); Grunstein and Hogness, Proc. Natl. Acad. Sci. U.S.A. 72:3961(1975)]. Alternatively, the presence of the gene may be detected byassays based on the physical, chemical, or immunological properties ofits expressed product. For example, cDNA clones, or DNA clones whichhybrid-select the proper mRNAs, can be selected which produce a proteinthat, e.g., has similar or identical electrophoretic migration,isoelectric focusing behavior, proteolytic digestion maps, or antigenicproperties as known for the protein of interest

A nucleic acid sequence encoding a hybrid enzyme comprising atransmembrane portion of a first enzyme and a catalytic portion of asecond enzyme may be obtained as follows. The sequence encoding thetransmembrane portion is removed from the second enzyme, leaving anucleic acid sequence comprising a nucleic acid sequence encoding theC-terminal portion of the second enzyme, which encompasses the catalyticsite. The sequence encoding the transmembrane portion of the firstenzyme is isolated or obtained via PCR and ligated to the sequenceencoding a sequence comprising the C-terminal portion of the secondenzyme.

Modified Enzymes

A nucleic acid sequence encoding a protein, particularly enzymes such asgalactosyltransferases, mannosidases and N-acetylglucosaminetransferases that are retained in the ER may be obtained by removing thesequence encoding the transmembrane fragment and substituting it for amethionine (initiation of translation) codon and by inserting betweenthe last codon and the stop codon of galactosyltransferase the nucleicacid sequence encoding an ER retention signal such as the sequenceencoding KDEL (amino acid residue sequence: lysine-asparticacid-glutamic acid-leucine) [Rothman Cell 50:521 (1987)].

Using Domains and Portions Thereof

As noted above, the phrases “at least a portion of ” or a “fragment of”refers to the minimal amino acid sequence necessary for a protein or apeptide to retain its natural or native function. For example, thefunction of an enzyme could refer to its enzymatic or catalytic role,its ability to anchor a protein in the Golgi apparatus, or as a signalpeptide. Thus, the phrases “at least a portion of a transmembranedomain” or “a fragment of a transmembrane domain” each refer to thesmallest amino acid segment of a larger transmembrane domain that stillretains at least part of the native transmembrane functionality (forexample, the function may be evident, albeit decreased). As anotherexample, the phrases “at least a portion of a catalytic region” or “afragment of a catalytic region” each refer to the smallest amino acidsegment of a larger catalytic region that still retains at least part ofthe native catalytic functionality (again, even if somewhat decreased).As discussed herein, one skilled in the art will know the minimal aminoacid segment that is necessary for a protein or a peptide to retain atleast some of the functionality of the native protein or peptide.

The glycosyltransferase enzymes are typically grouped into familiesbased on the type of sugar they transfer (galactosyltransferases,sialyltransferases, etc.). Based on amino-acid sequence similarity andthe stereochemical course of the reaction, glycosyltransferases can beclassified into at least 27 and perhaps as many as 47 different families[Campbell et al., Biochem. J. 326:929-939 (1997), Biochem. J. 329:719(1998)]. The majority of glycosyltransferases cloned to date are type IItransmembrane proteins (i.e., single transmembrane domain with the NH₂terminus in the cytosol and the COOH terminus in the lumen of the Golgiapparatus). Regardless of how they are classified, allglycosyltransferases share some common structural features: a shortNH₂-terminal cytoplasmic tail, a 16-20 amino acid signal-anchor ortransmembrane domain, and an extended stem region which is followed bythe large COOH-terminal catalytic domain. The cytoplasmic tail appearsto be involved in the specific localization of some types ofglycosyltransferases to the Golgi [Milland et al., J. Biol. Chem.277:10374-10378]. The signal anchor domains can act as both uncleavablesignal peptides and as membrane-spanning regions that orient thecatalytic domains of the glycosyltransferases within the lumen of theGolgi apparatus.

In one embodiment of the present invention, a portion defined by theN-terminal 77 amino acids of Nicotiana benthamiana (tobacco)acetylglucosaminyltransferase I are contemplated for use in the hybridenzyme(s), since this portion has been found to be sufficient to targetto and to retain a reporter protein in the plant Golgi apparatus [Esslet al., FEBS Lett 453:169-173 (1999)]. Subcellular localization intobacco of various fusion proteins between the putative cytoplasmic,transmembrane and stem domains revealed that thecytoplasmic-transmembrane domains alone were sufficient to sustain Golgiretention of β1,2-xylosyltransferase without the contribution of anyluminal sequences [Dimberger et al., Plant Mol. Biol. 50:273-281(2002)]. Thus, as noted above, certain embodiments of the presentinvention utilize portions of the CTS region which involve only thecytoplasmic-transmembrane domains (or portions thereof) withoututilizing the stem region of the CTS region. However, while some typesof glycosyltransferases rely primarily on their transmembrane domain forGolgi retention, other types require their transmembrane region andsequences flanking one or both sides of this region [Colley,Glycobiology 7:1-13 (1997)]. For example, the N-terminal peptideencompassing amino acids 1 to 32 appears to be the minimal targetingsignal sufficient to localize β1,6 N-acetylglucosaminyltransferase tothe Golgi. This peptide makes up the cytoplasmic and transmembranedomains of this enzyme [Zerfaoui et al., Glycobiology 12:15-24].

A great deal of information is available on the amino acid sequences ofthe domains for specific glycosyltransferases. For example, the aminoacid sequence of the mammalian galactosyltransferase provided in GenBankAccession No. AAM17731 has the “stem” and “catalytic” domains spanningresidues 19 to 147 and residues 148 to 397, respectively [U.S. Pat. No.6,416,988, hereby incorporated by reference]—and the present invention,in certain embodiments, specifically contemplates such portions for usein the hybrid enzyme(s). The amino acid sequence of the rat liversialyltransferase provided in GenBank Accession No. AAC91156 has a9-amino acid NH₂-terminal cytoplasmic tail, a 17-amino acidsignal-anchor domain, and a luminal domain that includes an exposed stemregion followed by a 41 kDa catalytic domain [Hudgin et al., Can. J.Biochem. 49:829-837 (1971); U.S. Pat. Nos. 5,032,519 and 5,776,772,hereby incorporated by reference]. Known human and mouseβ1,3-galactosyltransferases have a catalytic domain with eight conservedregions [Kolbinger et al., J. Biol. Chem. 273:433-440 (1998); Hennet etal., J. Biol. Chem. 273:58-65 (1998); U.S. Pat. No. 5,955,282, herebyincorporated by reference]. For example, the amino acid sequence ofmouse UDP-galactose: β-N-acetylglucosamine β1,3-galactosyltransferase-Iprovided in GenBank Accession No. NM020026 has the following catalyticregions: region 1 from residues 78-83; region 2 from residues 93-102;region 3 from residues 116-119; region 4 from residues 147-158; region 5from residues 172-183; region 6 from residues 203-206; region 7 fromamino acid residues 236-246; and region 8 from residues 264-275. [Hennetet al., supra.]—all of which are contemplated in certain embodiments ofthe present invention as useful portions in the context of the hybridenzyme(s) discussed above.

While earlier comparisons amongst known cDNA clones ofglycosyltransferases had revealed very little sequence homology betweenthe enzymes [Paulson et al., J. Biol. Chem. 264:17615-618 (1989)], morerecent advances have made it possible to deduce conserved domainstructures in glycosyltransferases of diverse specificity [Kapitonov etal., Glycobiology 9:961-978 (1999)]. For example, the nucleic acid andamino acid sequences of a number of glycosyltransferases have beenidentified using sequence data provided by the complete genomicsequences obtained for such diverse organisms as Homo sapiens (humans),Caenorhabditis elegans (soil nematode), Arabidopsis thaliana (thalecress, a mustard) and Oryza sativa (rice).

As a result of extensive studies, common amino acid sequences have beendeduced for homologous binding sites of various families ofglycosyltransferases. For example, sialyltransferases have sialyl motifsthat appear to participate in the recognition of the donor substrate,CMP-sialic acid [Paulson et al., J. Biol. Chem., 264:17615-17618 (1989);Datta et al., J. Biol. Chem., 270:1497-1500 (1995); Katsutoshi, TrendsGlycosci. Glycotech. 8:195-215 (1996)]. The hexapeptide RDKKND in Galα1-3 galactosyltransferase and RDKKNE in GlcNAc β1-4galactosyltransferase have been suggested as the binding site forUDP-Gal [(Joziasse et al., J. Biol. Chem., 260:4941-4951 (1985), J.Biol. Chem., 264:14290-14297 (1989); Joziasse, Glycobiology, 2:271-277(1992)].

A small, highly-conserved motif formed by two aspartic acid residues(D×D), which is frequently surrounded by a hydrophobic region, has beenidentified in a large number of different eukaryotic transferases,including α-1,3-mannosyltransferase, β-1,4-galactosyltransfereases,α-1,3-galactosyltransferases, glucuronyltransferases,fucosyltransferases, glycogenins and others [Wiggins et al., Proc. Natl.Acad. Sci. U.S.A. 95:7945-7950 (1998)]. Mutation studies indicate thatthis motif is necessary for enzymatic activity [Busch et al., J. Biol.Chem. 273:19566-19572 (1998); Wang et al., J. Biol. Chem.277:18568-18573 (2002)]. Multiple peptide alignment showed severalmotifs corresponding to putative catalytic domains that are conservedthroughout all members of the β3-galactosyltransferase family, namely, atype II transmembrane domain, a conserved D×D motif, an N-glycosylationsite and five conserved cysteines [Gromova et al., Mol. Carcinog.32:61-72 (2001)].

Through the use of BLAST searches and multiple alignments, the E-X₇-Emotif was found to be a highly conserved among the members of fourfamilies of retaining glycosyltransferases [Cid et al., J. Biol. Chem.275:33614-33621 (2000)]. The O-linked acetylglucosaminyltransferases(GlcNAc) add a single β-N-acetylglucosamine moiety to specific serine orthreonine hydroxyls. BLAST analyses, consensus secondary structurepredictions and fold recognition studies indicate that a conserved motifin the second Rossmann domain points to the UDP-GlcNAc donor-bindingsite [Wrabl et al., J. Mol. Biol. 314:365-374 (2001)]. Theβ1,3-glycosyltransferase enzymes identified to date share severalconserved regions and conserved cysteine residues, all being located inthe putative catalytic domain. Site-directed mutagenesis of the murineβ3GatT-I gene (Accession No. AF029790) indicate that the conservedresidues W101 and W162 are involved in the binding of the UDP-galactosedonor, the residue W315 in the binding of theN-acetylglucosamine-β-p-nitrophenol acceptor, and the domain includingE264 appears to participate in the binding of both substrates [Malissardet al., Eur. J. Biochem. 269:233-239 (2002)].

Expression of Proteins of Interest in Plant Host System

The nucleic acid encoding the hybrid or modified enzymes or otherheterologous proteins, such as a heterologous glycoprotein may beinserted according to certain embodiments of the present invention intoan appropriate expression vector, i.e., a vector which contains thenecessary elements for the transcription and translation of the insertedcoding sequence, or in the case of an RNA viral vector, the necessaryelements for replication and translation, as well as selectable markers.These include but are not limited to a promoter region, a signalsequence, 5′ untranslated sequences, initiation codon (depending uponwhether or not the structural gene comes equipped with one), andtranscription and translation termination sequences. Methods forobtaining such vectors are known in the art (see WO 01/29242 forreview).

Promoter sequences suitable for expression in plants are described inthe art, e.g., WO 91/198696. These include non-constitutive promoters orconstitutive promoters , such as, the nopaline synthetase and octopinesynthetase promoters, cauliflower mosaic virus (CaMV) 19S and 35Spromoters and the figwort mosaic virus (FMV) 35 promoter (see U.S. Pat.Nos. 5,352,605 and 6,051,753, both of which are hereby incorporated byreference). Promoters used may also be tissue specific promoterstargeted for example to the endosperm, aleurone layer, embryo, pericarp,stem, leaves, tubers, roots, and the like.

A signal sequence allows processing and translocation of a protein whereappropriate. The signal can be derived from plants or could be non-plantsignal sequences. The signal peptides direct the nascent polypeptide tothe endoplasmic reticulum, where the polypeptide subsequently undergoespost-translational modification. Signal peptides can routinely beidentified by those of skill in the art. They typically have atripartite structure, with positively charged amino acids at theN-terminal end, followed by a hydrophobic region and then the cleavagesite within a region of reduced hydrophobicity.

The transcription termination is routinely at the opposite end from thetranscription initiation regulatory region. It may be associated withthe transcriptional initiation region or from a different gene and maybe selected to enhance expression. An example is the NOS terminator fromAgrobacterium Ti plasmid and the rice alpha-amylase terminator.Polyadenylation tails may also be added. Examples include but are notlimited to Agrobacterium octopine synthetase signal, [Gielen et al.,EMBO J. 3:835-846 (1984)] or nopaline synthase of the same species[Depicker et al., Mol. Appl. Genet. 1:561-573 (1982)].

Enhancers may be included to increase and/or maximize transcription ofthe heterologous protein. These include, but are not limited to peptideexport signal sequence, codon usage, introns, polyadenylation, andtranscription termination sites ( see WO 01/29242).

Markers include preferably prokaryote selectable markers. Such markersinclude resistance toward antibiotics such as ampicillin, tetracycline,kanamycin, and spectinomycin. Specific examples include but are notlimited to streptomycin phosphotransferase (spt) gene coding forstreptomycin resistance, neomycin phosphotransferase (nptII) geneencoding kanamycin or geneticin resistance, hygromycinphosphotransferase (hpt) gene encoding resistance to hygromycin.

The vectors constructed may be introduced into the plant host systemusing procedures known in the art (reviewed in WO 01/29242 and WO01/31045). The vectors may be modified to intermediate planttransformation plasmids that contain a region of homology to anAgrobacterium tumefaciens vector, a T-DNA border region from A.tumefaciens. Alternatively, the vectors used in the methods of thepresent invention may be Agrobacterium vectors. Methods for introducingthe vectors include but are not limited to microinjection, velocityballistic penetration by small particles with the nucleic acid eitherwithin the matrix of small beads or particles, or on the surface andelectroporation. The vector may be introduced into a plant cell, tissueor organ. In a specific embodiment, once the presence of a heterologousgene is ascertained, a plant may be regenerated using procedures knownin the art. The presence of desired proteins may be screened usingmethods known in the art, preferably using screening assays where thebiologically active site is detected in such a way as to produce adetectable signal. This signal may be produced directly or indirectly.Examples of such assays include ELISA or a radioimmunoassay.

Transient Expression

The present invention specifically contemplates both stable andtransient expression of the above-described hybrid enzymes. Techniquesfor transforming a wide variety of higher plant species for transientexpression of an expression cassette are well known [see, for example,Weising et al., Ann. Rev. Genet. 22:421-477(1988)]. Variables ofdifferent systems include type nucleic acid transferred (DNA, RNA,plasmid, viral), type of tissue transformed, means of introducingtransgene(s), and conditions of transformation. For example, a nucleicacid construct may be introduced directly into a plant cell usingtechniques ranging from electroporation, PEG poration, particlebombardment, silicon fiber delivery, microinjection of plant cellprotoplasts or embryogenic callus or other plant tissue, orAgrobacterium-mediated transformation [Hiei et al., Plant J. 6:271-282(1994)]. Because transformation efficiencies are variable, internalstandards (eg, 35S-Luc) are often used to standardize transformationefficiencies.

Expression constructs for transient assays include plasmids and viralvectors. A variety of plant viruses that can be employed as vectors areknown in the art and include cauliflower mosaic virus (CaMV),geminivirus, brome mosaic virus, and tobacco mosaic virus.

Plant tissues suitable for transient expression include cultured cells,either intact or as protoplasts (in which the cell wall is removed),cultured tissue, cultured plants, and plant tissue such as leaves.

Some transient expression methods utilize gene transfer into plant cellprotoplasts mediated by electroporation or polyethylene glycol (PEG).These methods require the preparation and culture of plant protoplasts,and involve creating pores in the protoplast through which nucleic acidis transferred into the interior of the protoplast.

Exemplary electroporation techniques are described in Fromm et al, Proc.Natl. Acad. Sci. 82: 5824 (1985). The introduction of DNA constructsusing polyethylene glycol precipitation is described in Paszkowski etal., EMBO J. 3: 2717-2722 (1984). PEG-mediated transformation of tobaccoprotoplasts, which includes the steps of isolation, purification, andtransformation of the protoplasts, are described in Lyck et al., (1997)Planta 202: 117-125 and Scharf et al., (1998) Mol Cell Biol 18:2240-2251, and Kirschner et al., (2000) The Plant J 24(3): 397-411.These methods have been used, for example, to identify cis-actingelements in promoters activated by external stimuli, Abel and Theologis(1994) Plant J 5: 421-427; Hattori et al., (1992) Genes Dev 6: 609-618;Sablowski et al., (1994) EMBO J 13: 128-137; and Solano et al., (1995)EMBO J 14: 1773-1784), as well as for other gene expression studies(U.S. Pat. No. 6,376,747, hereby incorporated by reference).

Ballistic transformation techniques are described in Klein et al.,(1987) Nature 327: 70-73. Biolistic transient transformation is usedwith suspension cells or plant organs. For example, it has beendeveloped for use in Nicotiana tabacum leaves, Godon et al (1993)Biochimie 75(7): 591-595. It has also been used in investigating plantpromoters, (Baum et al., (1997) Plant J 12: 463-469; Stromvik et al.,(1999) Plant Mol Biol 41(2): 217-31, Tuerck and Fromm (1994) Plant Cell6: 1655-1663; and U.S. Pat. No. 5,847,102, hereby incorporated byreference), and to characterize transcription factors (Goff et al.,(1990) EMBO J 9: 2517-2522; Gubler et al., (1999) Plant J 17: 1-9; andSainz et al., (1997) Plant Cell 9: 611-625).

Other methods allow visualization of transient expression of genes insitu, such as with onion epidermal peels, in which GFP expression invarious cellular compartments was observed (Scott et al., (1999)Biotechniques 26(6): 1128-1132

Nucleic acids can also be introduced into plants by direct injection.Transient gene expression can be obtained by injection of the DNA intoreproductive organs of a plant (see, for example, Pena et al., (1987)Nature, 325.:274), such as by direct DNA transfer into pollen (see, forexample, Zhou et al., (1983) Methods in Enzymology, 101:433; D. Hess(1987) Intern Rev. Cytol., 107:367; Luo et al., (1988) Plant Mol. Biol.Reporter; 6:165. DNA can also be injected directly into the cells ofimmature embryos (see, for example, Neuhaus et al., (1987) Theor. Appl.Genet: 75:30; and Benbrook et al., (1986) in Proceedings Bio Expo 1986,Butterworth, Stoneham, Mass., pp. 27-54).

Agrobacterium-mediated transformation is applicable to both dicots andmonocots. Optimized methods and vectors for Agrobacterium-mediatedtransformation of plants in the family Graminae, such as rice and maizehave been described (see, for example, Heath et al., (1997) Mol.Plant-Microbe Interact. 10:221-227; Hiei et al., (1994) Plant J.6:271-282 and Ishida et al., (1996) Nat. Biotech. 14:745-750). Theefficiency of maize transformation is affected by a variety of factorsincluding the types and stages of tissue infected, the concentration ofAgrobacterium, the tissue culture media, the Ti vectors and the maizegenotype.

Another useful basic transformation protocol involves a combination ofwounding by particle bombardment, followed by use of Agrobacterium forDNA delivery (see, for example, Bidney et al., (1992) Plant Mol. Biol.18:301-313). Both intact meristem transformation and a split meristemtransformation methods are also known (U.S. Pat. No. 6,300,545, herebyincorporated by reference).

Additional methods utilizing Agrobacteria include agroinfection andagroinfiltration. By inserting a viral genome into the T-DNA,Agrobacterium can be used to mediate the viral infection of plants (see,for example, U.S. Pat. No. 6,300,545, hereby incorporated by reference).Following transfer of the T-DNA to the plant cell, excision of the viralgenome from the T-DNA (mobilization) is required for successful viralinfection. This Agrobacterium-mediated method for introducing a virusinto a plant host is known as agroinfection (see, for example, Grimsley,“Agroinfection” pp. 325-342, in Methods in Molecular Biology, vol 44:Agrobacterium Protocols, ed. Gartland and Davey, Humana Press, Inc.,Totowa, N.J.; and Grimsley (1990) Physiol. Plant. 79:147-153).

The development of plant virus gene vectors for expression of foreigngenes in plants provides a means to provide high levels of geneexpression within a short time. Suitable viral replicons includedouble-stranded DNA from a virus having a double stranded DNA genome orreplication intermediate. The excised viral DNA is capable of acting asa replicon or replication intermediate, either independently, or withfactors supplied in trans. The viral DNA may or may not encodeinfectious viral particles and furthermore may contain insertions,deletions, substitutions, rearrangements or other modifications. Theviral DNA may contain heterologous DNA, which is any non-viral DNA orDNA from a different virus. For example, the heterologous DNA maycomprise an expression cassette for a protein or RNA of interest.

Super binary vectors carrying the vir genes of Agrobacterium strainsA281 and A348 are useful for high efficiency transformation of monocots.However, even without the use of high efficiency vectors, it has beendemonstrated that T-DNA is transferred to maize at an efficiency thatresults in systemic infection by viruses introduced by agroinfection,although tumors are not formed (Grimsley et al., (1989) Mol. Gen. Genet.217:309-316). This is because integration of the T-DNA containing theviral genome is not required for viral multiplication, since the excisedviral genome acts as an independent replicon.

Another Agrobacteria-mediated transient expression assay is based onAgrobacterium-mediated transformation of tobacco leaves in planta (Yanget al., (2000) The Plant J 22(6): 543-551). The method utilizesinfiltration of agrobacteria carrying plasmid constructs into tobaccoleaves, and is referred to as agroinfiltration; it has been utilizedused to analyze in vivo expression of promoters and transcriptionfactors in as little as 2-3 days. It also allows examination of effectsof external stimuli such as pathogen infections and environmentalstresses on promoter activity in situ.

EXAMPLE 1

An Arabidopsis thaliana cDNA encoding β1,2-xylosyltransferase wasisolated from a cDNA library by a previously described PCR based siblingselection procedure [Bakker et al., BBRC 261:829 (1999)].Xylosyltransferase activity was confirmed by immunostaining oftransfected CHO cells with a xylose specific antibody purified fromrabbit-anti-horseradish-peroxidase antiserum. A DNA fragment coveringthe N-terminal part of the xylosyltransferase was amplified usingprimers: XylTpvuF: ATACTCGAGTTAACAATGAGTAAACGGAATC (SEQ ID NO: 45) andXylTpvuR: TTCTCGATCGCCGATTGGTTATTC (SEQ ID NO: 46)XhoI and HpaI restriction sites were introduced in front of the startcodon and a PvuI was introduced at the reverse end. A C-terminalfragment from Human β1,4galactosyltransferase (acc.no. x55415, Aoki1992) was amplified using primers GalTpvuF:GCCGCCGCGATCGGGCAGTCCTCC (SEQID NO:47) and GalTrev:AACGGATCCACGCTAGCTCGGTGTCCCGAT (SEQ ID NO:48) thusintroducing PvuI and BamHI sites. The XhoI/PvuI and PvuI/BamHI digestedPCR fragments were ligated in XhoI/BamHI digested pBluescriptSK+ andsequenced. The resulting open reading frame encodes a fusion proteincontaining the first 54 amino acids of A. thalianaβ1,2-xylosyltransferase fused with amino acid 69 to 398 of humanβ1,4galactosyltransferase and is designated as TmXyl-GalT. The fragmentwas cloned into a plant expression vector between the CaMV35S promoterand Nos terminator, using HpaI/BamHI. The clone was introduced intoNicotiana tabacum (samsun NN) as described for native humanβ1,4galactosyltransferase [Bakker et al., Proc. Nat. Acad. Sci. USA98:2899 (2001)].

Protein extract of transgenic plants and Western Blots were made asdescribed [Bakker et al., Proc. Nat. Acad. Sci. USA 98:2899 (2001)].Based on reaction with the lectin RCA, a transgenic plant expressingTmXylGalT was selected for further glycan analysis by MALDI-TOF [Elberset al., Plant Physiology 126:1314 (2001] and compared with glycansisolated from plants expressing native β1,4galactosyltransferase andwith glycans from wild-type plants. Relative peak areas of the MALDI-TOFspectrum are given in Table 1. That is to say, Table 1 is a comparisonof the results of mass spec (MALDI-TOF) analysis of N-glycans ofendogenous glycoproteins of control tobacco (“Tobacco”), transgenictobacco expressing human beta-1,4-galactosyltransferase (“GalT”) andtransgenic tobacco plants expressing the beta-1,4-galactosyltransferasegene of which the CTS region has been replaced with that ofbeta-1,2-xylosyltransferase (“TmXyl-GalT”). TABLE 1 m/z Type TobaccoGalT TmXyl-GalT  933 M3 3 7 1065 XM3 10 16 3 1079 FM3 4 1095 M4 9 1211FXM3 41 27 1257 M5 4 5 23 1268 GNXM3 4 1298 GalGNM3 6 1298 GNM4 1414GNFXM3 27 13 5 1419 M6 7 8 10 1460 GalGNM4 11 1460 GNM5 1485 GN2FM3 41576 GalGNFXM3 5 1576 GNFXM4 1581 M7 3 4 1606 GNFM5 3 1606 GalGNFM4 1617GN2FXM3 8 9 1622 GalGNM5 9 1622 GNM6 1743 M8 2 3 1768 GalGNFM5 3 1768GNFM6 1779 GalGN2FXM3 2 1905 M9 1 1941 Gal2GN2FXM3 2 TOTAL 100 100 101These data show that:1. In TmXylGalT plants, xylosylation and fucosylation of the glycans isdramatically reduced: 82% of the glycans do not carry xylose nor fucoseas compared to 14% in wild-type plants.2. Galactosylation has increased from 9% in GalT plants to 32% inTmXylGalT plants.

EXAMPLE 2

A transgenic plant expressing said TmXyl-GalT gene (TmXyl-GalT-12 plant)was selected (above) based on lectin blotting using biotin-labelled RCA(Vector Laboratories, Burlingame, Calif.). Comparison of proteinextracts of MGR48 transgenic (control) plant, a selected transgenicplant expressing the unmodified human β1,4-galactosyltransferase geneand TmXyl-GalT-12 plant for the presence of xylose and fucose usinganti-HRP (horseradish peroxidase) polyclonal antibody (known for highanti-xylose and anti-fucose reactivity) clearly showed reduced xyloseand fucose (FIG. 34: “Anti-HRP”). Western blotting using an anti-xylosefraction of the anti-HRP and an anti-fucose fraction (each of which canbe prepared by affinity chromatography over the appropriate ligand)showed that especially xylose was reduced compared to control plants(FIG. 34: anti-Fuc” and “anti-Xyl”).

EXAMPLE 3

The TmXyl-GalT-12 plant was crossed with a transgenic plant expressingthe monoclonal antibody MGR48 from a single T-DNA integration event(MGR48-31) and which was first made homozygous by selecting offspringplants not segregating for the kanamycin resistance marker and antibodyproduction (MGR48-31-4). Pollen of MGR48-31-4 was used for pollinationof emasculated TmXyl-GalT-12 plants. Vice versa, pollen of TmXyl-GalT-12plant was used for fertilization on emasculated MGR48-31-4 plants. Anumber of F1 plants were analyzed for the presence of MGR48 by westernblotting and for galactosylation of endogenous glycoproteins by lectinblotting using RCA (FIG. 35). One plant expressing MGR48 and showinggalactosylation of endogenous glycoproteins was selected for furtheranalysis. This plant was identified as XGM8.

Seeds from TmXyl-GalT-12 (♀)×MGR48-31-4 (♂) were sown and F1 offspringplants (XGM) were analysed for antibody production by Western blottingand for galactosylation by lectin blotting using biotinylated RCA120(Vector Labs., Burlingame, Calif.) using standard techniques asdescribed before. All plants as expected expressed the monoclonalantibody MGR48 and the majority also had galactosylated glycans asdepicted from lectin blotting using RCA120. A single plant expressingboth antibody MGR48 and having galactosylated N-glycans was chosen forfurther analysis (XGM8) (TmXyl-GalT-12× MGR48-31-4 offpring plant 8).The monoclonal recombinant MGR48 antibody was purified from this plantas described before and submitted to N-glycan analysis by MALDI-TOF.

Briefly, XGM8 plant was grown in greenhouse for antibody productionunder optimal conditions [Elbers et al., Plant Physiology 126:1314(2001)]. Protein extract of leaves of transgenic XGM8 plant was made andmonoclonal antibody was purified using protein G chromatography asdescribed [Bakker et al, Proc. Nat. Acad. Sci. USA 98:2899 (2001)].MALDI-TOF of N-glycans of purified monoclonal antibody was as described(Elbers et al., 2001, supra). The presence of galactose on glycans wasestablished by enzyme sequencing using bovine testis β-galactosidase asdescribed (Bakker et al., 2001, supra; Table 2). Table 2 (below) is acomparison of the results of mass spec (MALDI-TOF) analysis of N-glycansof endogenous glycoproteins (“Xyl-GalT Endo”) of a F1 hybrid ofTmXyl-GalT-12plant and plant producing rec-mAb (MGR48) and of N-glycansof rec-mAB purified by protein G chromatography from said P1 hybrid.TABLE 2 Xyl-GalT Xyl-GalT m/z Type Endo IgG 933 M3 6 4 1065 XM3 2 2 1079FM3 2 3 1095 M4 5 5 1136 GNM3 1 2 1211 FXM3 6 3 1241 FM4 3 2 1257 M5 1712 1268 GNXM3 1 2 1282 GNFM3 2 3 1298 GalGNM3 3 4 1403 FM5 4 3 1414GNFXM3 2 4 1419 M6 5 4 1430 GNXM4 2 2 1430 GalGNXM3 1444 GNFM4 1 3 1444GalGNFM3 1460 GalGNM4 8 10 1460 GNM5 1471 GN2XM3 1 1485 GN2FM3 1 1 1501GalGN2M3 1 1 1576 GalGNFXM3 2 3 1576 GNFXM4 1581 M7 2 2 1593 GalGNXM4 12 1593 GNXM5 1606 GNFM5 3 4 1606 GalGNFM4 1617 GN2FXM3 2 1 1622 GalGNM56 6 1622 GNM6 1647 GalGN2FM3 1 1 1663 Gal2GN2M3 1 1 1738 GNFXM5 1 2 1738GalGNFXM4 1743 M8 1 2 1754 GalGNXM5 1 2 1768 GalGNFM5 2 3 1768 GNFM61784 GNM7 1 1 1784 GalGNM6 1809 Gal2GN2FM3 2 1 1900 GNFXM6 1 1900GalGNFXM5 1905 M9 1 1 TOTAL 101 102These data show that:1. In the F1 hybrid, xylosylation and fucosylation of the glycans isdramatically reduced: 43% of the glycans of endogenous glycoproteinslack xylose and fucose as compared to only 14% in wild-type tobaccoplants.2. The glycans of purified mAb of this F1 hybrid have reduced xylose andfucose, 47% compared to 14% for wildtype tobacco. See also FIG. 36,panels B-D.3. Galactosylation of endogenous glycoproteins of F1 hybrid hasincreased from 9% in GalT plants to 37% in F1 TmXyl-GalT X MGR48 plant.See also FIG. 35.4. Purified rec-mAb from said F1 (see FIG. 36, panel A) shows increasedgalactosylation; that is to say, 46% has galactose. See also FIG. 36,panel E.It should however be noted that the observed quantities (MALDI-TOF) donot necessarily reflect the molar rations of said glycoforms in vivo.Quantification based on MALDI-TOF can be under-or overestimateddepending on the specific glycoform under study. Also, since there is nomolecular# weight difference between Gal and Man, some peaks can not be annotatedunambiguously unless there are clear differences in relative height ofspecific molecules before and after galactosidase treatment.

EXAMPLE 4

A more direct comparison of xylose, fucose and galactose content wasdone by examining the MGR48 IgG antibodies from hybridoma, transgenictobacco and TmXyl-GalT transgenic tobacco. As mentioned above, theTmXyl-GalT-12 plant was crossed with tobacco plant expressing MGR48 IgG(MGR48 tobacco) resulting in an F1 hybrid harbouring MGR48 TmXyl-GalT.An F1 plant was chosen for extraction and purification of MGR48 IgG.Antibodies from said plants (tobacco and TmXyl-GalT) were isolated andpurified using protein G chromatography (Elbers et al., 2001. PlantPhysiology 126: 1314-1322). 300 nanograms amounts of each, hybridomaMGR48 and plant-derived recMGR48, were loaded on precast 12% SDS-PAGEgels (BioRad) and run. The contents of each lane were as follows: Lane1, MGR48 from hybridoma; Lane 2, purified recMGR48 from normaltransgenic tobacco plant; and Lane 3, purified recMGR48 from TmXyl-GalTtransgenic plant. Following SDS-PAGE proteins were transferred tonitrocellulose using CAPS buffer. Blots were incubated with A,anti-mouse IgG; B, polyclonal rabbit anti-HRP (anti-xylose/(alpha1,3-fucose); C, anti-xylose; D, anti-(alpha 1,3-) fucose antibodies; andE, biotinylated RCA. Detection was with LumiLight on Lumi Imagerfollowing incubation with HRP-labelled sheep anti-mouse (panel A) orgoat-anti-rabbit (panels B-D) antibodies and HRP-labeled streptavidin(E).

Panel A shows that approximately similar amounts of the MGR48 IgG wasloaded for all lanes (1-3). L refers to Light chain and H, heavy chainof MGR48 IgG.

Panel B shows that the heavy chain of MGR48 antibody in lane 2 (tobacco)strongly reacts with anti-HRP as expected, whereas the heavy chain ofhybridoma derived MGR48 (lane 1) does not (as expected). Hybridomaderived antibodies do not carry xylose and alpha 1,3-fuctose residues.Remarkably, MGR48 antibodies from TmXyl-GalT tobacco plant also do notreact, suggesting that the heavy chain of antibody from this plant havesignificantly reduced (perhaps by 90% or more) the amounts of xylose andfucose residues on the N-glycans. This is confirmed by experimentsdepicted in panels C (anti-xylose) and D (anti-fucose). Panel E showsthat the heavy chain of MGR48 antibody of hybridoma (lane 1) has agalactosylated N-glycan, whereas tobacco-derived MGR48 (lane 2) has not,both as expected. Heavy chain of MGR48 from the TmXyl-GalT plant (lane3) also has galactosylated N-glycan due to the presence of the constructexpressing the hybrid enzyme.

These data are in agreement with the data obtained from similarexperiments using total protein extracts from similar plants (tobaccoand TmXyl-GalT-12 plant) as shown previously and confirm that the noveltrait introduced in tobacco from expression of TmXyl-GalT gene can bestably transmitted to offspring and a recombinant monoclonal antibody.

EXAMPLE 5

Further characterization of the above-described F1 hybrid was performedby treatement with beta-galactosidase. Table 3 is a comparison of theresults of mass spec (MALDI-TOF) analysis of N-glycans of rec-mAbspurified by protein G chromatography from an F1 hybrid of TmXyl-GalT andMGR48 plant before and after treatment of the glycans withbeta-galactosidase. TABLE 3 Xyl-GalT Xyl-GalT m/z Type IgG- IgG +beta-galactosidase 933 M3 4 4 1065 XM3 2 2 1079 FM3 3 3 1095 M4 5 4 1136GNM3 2 3 1211 FXM3 3 4 1241 FM4 2 2 1257 M5 12 13 1268 GNXM3 2 3 1282GNFM3 3 3 1298 GalGNM3 4 4 1403 FM5 3 2 1414 GNFXM3 4 5 1419 M6 4 3 1430GNXM4 2 2 1430 GalGNXM3 1444 GNFM4 3 3 1444 GalGNFM3 1460 GalGNM4 10 141460 GNM5 1471 GN2XM3 1 1485 GN2FM3 1 1 1501 GalGN2M3 1 1576 GalGNFXM3 33 1576 GNFXM4 1581 M7 2 2 1593 GalGNXM4 2 2 1593 GNXM5 1606 GNFM5 4 61606 GalGNFM4 1617 GN2FXM3 1 1 1622 GalGNM5 6 1 1622 GNM6 1647 GalGN2FM31 1663 Gal2GN2M3 1 1738 GNFXM5 2 2 1738 GalGNFXM4 1743 M8 2 2 1754GalGNXM5 2 1 1768 GalGNFM5 3 1 1768 GNFM6 1784 GNM7 1 1 1784 GalGNM61809 Gal2GN2FM3 1 1900 GNFXM6 1 1900 GalGNFXM5 1905 M9 1 1 TOTAL 102 100These data show that:1. Rec-mAbs from F1 hybrid contain galactose which can be deduced fromthe observed reduction of specific (galactose-containing) glycoformsafter beta-galactosidase treatment and increase of glycoforms lackinggalactose. Note the observed reduction of m/z 1622 from 6 to 1% and# simultaneous increase of m/z 1460 from 10 to 14% which is the resultof the removal of galactose from GalGNM5 to give rise to GNM5. The sameis true for m/z 1768 (3 to 1% decrease) and corresponding m/z 1606 peak(4 to 6% increase). See also FIG. 36, panel E.2. Similarly a number of peaks that can be attributed to galactosecontaining glycans vanish upon treatment with galactosidase, especiallym/z 1501, 1647 and 1663 confirming the presence of galactose.

EXAMPLE 6

In another embodiment, the aminoterminal CTS region of an insectMannosidase III gene (accession number: AF005034; mistakenly annotatedas a Mannosidase II gene!) is replaced by a mouse signal peptide codingsequence for import into the endoplasmic reticulum (see FIG. 37). Thesignal peptide sequence encodes a fully active signal peptide normallypresent at the aminoterminus of IgG sequences and has been usedsuccessfully in plants and other organisms before. Furthermore asynthetic sequence coding for a so-called endoplasmic reticulumretention sequence (KDEL) is added to the carboxyterminus of the genepart encoding the catalytic fragment for ER retention. The hybridMannosidase III protein encoded by this gene sequence will henceaccumulate preferentially in the endoplasmic reticulum.

EXAMPLE 7

In another embodiment, the aminoterminal CTS region of the humanbeta-1,4-galactosyltransferase (GalT) gene (accession A5255 1) isreplaced by a mouse signal peptide coding sequence for import into theendoplasmic reticulum (see FIG. 39). The signal peptide sequence encodesa fully active signal peptide normally present at the aminoterminus ofIgG sequences and has been used successfully in plants and otherorganisms before. Furthermore a synthetic sequence coding for aso-called endoplasmic reticulum retention sequence (KDEL) is added tothe carboxyterminus of the gene part encoding the catalytic fragment forER retention. The hybrid beta-1,4-galactosyl-transferase protein encodedby this gene sequence will hence accumulate preferentially in theendoplasmic reticulum.

EXAMPLE 8

In another embodiment, the aminoterminal CTS region of Arabidopsisthaliana GnTI (acc. AJ243198) is replaced by a mouse signal peptidecoding sequence for import into the endoplasmic reticulum (see FIG. 41).The signal peptide sequence encodes a fully active signal peptidenormally present at the aminoterminus of IgG sequences and has been usedsuccessfully in plants and other organisms before. Furthermore asynthetic sequence coding for a so-called endoplasmic reticulumretention sequence (KDEL) is added to the carboxyterminus of the genepart encoding the catalytic fragment for ER retention. The hybrid GnTIprotein encoded by this gene sequence will hence accumulatepreferentially in the endoplasmic reticulum.

EXAMPLE 9

In another embodiment, the aminoterminal CTS region of an Arabidopsisthaliana GnTII (acc. AJ249274) is replaced by a mouse signal peptidecoding sequence for import into the endoplasmic reticulum (see FIG. 43).The signal peptide sequence encodes a fully active signal peptidenormally present at the aminoterminus of IgG sequences and has been usedsuccessfully in plants and other organisms before. Furthermore asynthetic sequence coding for a so-called endoplasmic reticulumretention sequence (KDEL) is added to the carboxyterminus of the genepart encoding the catalytic fragment for ER retention. The hybrid GnTIIprotein encoded by this gene sequence will hence accumulatepreferentially in the endoplasmic reticulum.

EXAMPLE 10

In another embodiment, the aminoterminal CTS region of the human genefor beta-1,4-galactosyltransferase (GalT) gene is replaced by the CTSregion of the human gene for GnTI (TmhuGnTI-GalT) (see FIG. 45).

It is understood that the present invention is not limited to anyparticular mechanism. Nor is it necessary to understand the mechanism inorder to successfully use the various embodiments of the invention.Nonetheless, it is believed that there is a sequential distribution ofGolgi enzymes (FIG. 47) and that the swapping in of transmembranedomains of plant glycosyltransferases causes relocalization (FIG. 48).

It is understood that the present invention is not limited to theparticular methodology, protocols, cell lines, vectors, and reagentsdescribed herein, as these may vary. It is also to be understood thatthe terminology used herein is used for the purpose of describingparticular embodiments only, and is not intend to limit the scope of thepresent invention. It must be noted that as used herein and in theappended claims, the singular forms “a”, “an”, and “the” include pluralreference unless the context clearly dictates otherwise. Unless definedotherwise, all technical and scientific terms used herein have the samemeanings as commonly understood by one of ordinary skill in the art towhich this invention belongs.

The invention described and claimed herein is not to be limited in scopeby the specific embodiments herein disclosed, since these embodimentsare intended as illustrations of several aspects of the invention. Anyequivalent embodiments are intended to be within the scope of thisinvention. Indeed, various modifications of the invention in addition tothose shown and described herein will become apparent to those skilledin the art from the foregoing description. Such modifications are alsointended to fall within the scope of the appended claims.

Various references are cited herein, the disclosures of which areincorporated by reference in their entireties.

1. Nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprisinga transmembrane region of a plant glycosyltransferase and a catalyticregion of a mammalian glycosyltransferase.
 2. The nucleic acid of claim1, wherein said plant glycosyltransferase is a xylosyltransferase. 3.The nucleic acid of claim 1, wherein said plant glycosyltransferase is aN-acetylglucosaminyltransferase.
 4. The nucleic acid of claim 1, whereinsaid plant glycosyltransferase is a fucosyltransferase.
 5. The nucleicacid of claim 1, wherein said mammalian glycosyltransferase is a humangalactosyltransferase.
 6. The nucleic acid of claim 5, wherein saidhuman galactosyltransferase is encoded by at least a portion of thenucleic acid sequence of SEQ ID NO:1.
 7. An expression vector,comprising the nucleic acid of claim
 1. 8. A host cell transfected withthe vector of claim
 7. 9. The host cell of claim 8, wherein said hostcell is a plant cell.
 10. A cell suspension comprising the host cell ofclaim
 9. 11. The hybrid enzyme expressed by the plant cell of claim 9.12. The plant comprising the host cell of claim
 9. 13. Nucleic acidencoding a hybrid enzyme, said hybrid enzyme comprising a transmembraneregion of a first glycosyltransferase and a catalytic region of a secondglycosyltransferase.
 14. The nucleic acid of claim 13, wherein saidfirst glycosyltransferase comprises a plant glycosyltransferase
 15. Thenucleic acid of claim 14, wherein said plant glycosyltransferase is axylosyltransferase.
 16. The nucleic acid of claim 14, wherein said plantglycosyltransferase is a fucosyltransferase.
 17. The nucleic acid ofclaim 13, wherein said second glycosyltransferase comprises a mammalianglycosyltransferase.
 18. The nucleic acid of claim 17, wherein saidmammalian glycosyltransferase is a human galactosyltransferase.
 19. Thenucleic acid of claim 13, wherein said first glycosyltransferasecomprises a first mammalian glycosyltransferase and said secondglycosyltransferase comprises a second mammalian glycosyltransferase.20. The nucleic acid of claim 19, wherein said first mammalianglycosyltransferase is a non-human glycosyltransferase.
 21. The nucleicacid of claim 19, wherein said second mammalian glycosyltransferase is ahuman glycosyltransferase.
 22. A method, comprising: a. providing: i) aplant cell, and ii) an expression vector comprising nucleic acidencoding a hybrid enzyme, said hybrid enzyme comprising a transmembraneregion of a plant glycosyltransferase and a catalytic region of amammalian glycosyltransferase; and b. introducing said expression vectorinto said plant cell under conditions such that said hybrid enzyme isexpressed.
 23. The method of claim 22, wherein said plantglycosyltransferase is a xylosyltransferase.
 24. The method of claim 23,wherein said plant glycosyltransferase is aN-acetylglucosaminyltransferase.
 25. The method of claim 23, whereinsaid plant glycosyltransferase is a fucosyltransferase.
 26. The methodof claim 22, wherein said mammalian glycosyltransferase is a humangalactosyltransferase.
 27. The nucleic acid of claim 26, wherein saidhuman galactosyltransferase is encoded by at least a portion of thenucleic acid sequence of SEQ ID NO:1.
 28. A method, comprising: aproviding: i) a plant cell, ii) a first expression vector comprisingnucleic acid encoding a hybrid enzyme, said hybrid enzyme comprising atransmembrane region of a plant glycosyltransferase and a catalyticregion of a mammalian glycosyltransferase, and iii) a second expressionvector comprising nucleic acid encoding a heterologous glycoprotein; andb. introducing said first and second expression vectors into said plantcell under conditions such that said hybrid enzyme and said heterologousprotein are expressed.
 29. The method of claim 28, wherein saidheterologous protein is an antibody or antibody fragment.
 30. A method,comprising: a) providing: i) a first plant comprising a first expressionvector, said first vector comprising nucleic acid encoding a hybridenzyme, said hybrid enzyme comprising at least a portion of atransmembrane region of a plant glycosyltransferase and at least aportion of a catalytic region of a mammalian glycosyltransferase, andii) a second plant comprising a second expression vector, said secondvector comprising nucleic acid encoding a heterologous protein; and b)crossing said first plant and said second plant to produce progenyexpressing said hybrid enzyme and said heterologous protein.
 31. Aplant, comprising first and second expression vectors, said first vectorcomprising nucleic acid encoding a hybrid enzyme, said hybrid enzymecomprising at least a portion of a transmembrane region of a plantglycosyltransferase and at least a portion of a catalytic region of amammalian glycosyltransferase, said second vector comprising nucleicacid encoding a heterologous protein.
 32. The plant of claim 31, whereinsaid heterologous protein displays reduced amounts of fucose as comparedto when the heterologous protein is expressed in a plant in the absenceof said hybrid enzyme
 33. The plant of claim 31; wherein theheterologous protein displays reduced amounts of xylose as compared towhen the heterologous protein is expressed in a plant in the absence ofsaid hybrid enzyme.
 34. The plant of claim 31, wherein the heterologousprotein displays both reduced fucose and xylose, as compared to when theheterologous protein is expressed in a plant in the absence of saidhybrid enzyme.
 35. The plant of claim 31, wherein the heterologousprotein displays complex type bi-antannery glycans and containsgalactose residues on at least one of the arms.
 36. Nucleic acidencoding a hybrid enzyme, said hybrid enzyme comprising a modifiedmammalian glycosyltransferase, wherein a transmembrane portion has beendeleted and endoplasmic reticulum retention signal have been inserted.37. Nucleic acid encoding a hybrid enzyme, said hybrid enzyme comprisinga CTS region or portion thereof of a plant glycosyltransferase and acatalytic region of a mammalian glycosyltransferase, wherein said CTSregion is from a N-acetylglucosaminyltransferase I (GnTI) and saidcatalytic region is from a mannosidase II (ManII).
 38. Nucleic acidencoding a hybrid enzyme, said hybrid enzyme comprising a CTS region orportion thereof of a plant glycosyltransferase and a catalytic region ofa mammalian glycosyltransferase, wherein said CTS region or portionthereof is from a N-acetylglucosaminyltransferase I (GnTI) and saidcatalytic region is from a N-acetylglucosaminyltransferase II (GnTII).39. A plant host system, comprising (a) a nucleic acid sequence encodinga Mannosidase III glycosyltransferase; (b) a nucleic acid sequenceencoding a hybrid enzyme, said hybrid enzyme comprising a CTS region orportion thereof of a plant glycosyltransferase and a catalytic domain ofa mammalian glycosyltransferase
 40. A method, comprising (a) introducinginto said plant host system a vector comprising (i) a nucleic acidsequence encoding a hybrid enzyme comprising a catalytic region of agalactosyltransferase not normally found in a plant and a transmembraneregion of a protein, (ii) a nucleic acid sequence encoding a hybridenzyme comprising a transmembrane region of aN-acetylglucosaminyltransferase I (GnTI) and a catalytic region of amannosidase II (ManII), (iii) a nucleic acid sequence encoding a hybridenzyme comprising a transmembrane region of anN-acetylglucosaminyltransferase I (GnTI) and a catalytic region of aN-acetylglucosaminyltransferase II (GnTII); and (b) isolating a plant orportion thereof expressing said nucleic acid sequences.
 41. A method,comprising: a. providing: i) a host cell, and ii) an expression vectorcomprising nucleic acid encoding a hybrid enzyme, said hybrid enzymecomprising a transmembrane region of a first glycosyltransferase and acatalytic region of a second glycosyltransferase; and b. introducingsaid expression vector into said host cell under conditions such thatsaid hybrid enzyme is expressed.
 42. The method of claim 41, whereinsaid first glycosyltransferase comprises a plant glycosyltransferase.43. The method of claim 42, wherein said plant glycosyltransferase is axylosyltransferase.
 44. The method of claim 42, wherein said plantglycosyltransferase is a N-acetylglucosaminyltransferase.
 45. The methodof claim 42, wherein said plant glycosyltransferase is afucosyltransferase.
 46. The method of claim 41, wherein said secondglycosyltransferase comprises a mammalian glycosyltransferase.
 47. Themethod of claim 46, wherein said mammalian glycosyltransferase is ahuman galactosyltransferase.
 48. A method, comprising: a. providing: i)a host cell, ii) a first expression vector comprising nucleic acidencoding a hybrid enzyme, said hybrid enzyme comprising a transmembraneregion of a first glycosyltransferase and a catalytic region of a secondglycosyltransferase, and iii) a second expression vector comprisingnucleic acid encoding a heterologous glycoprotein; and b. introducingsaid first and second expression vectors into said host cell underconditions such that said hybrid enzyme and said heterologous proteinare expressed.
 49. The method of claim 48, wherein said heterologousprotein is an antibody or antibody fragment.
 50. The method of claim 48,further comprising the step of c) isolating said heterologous protein.51. The isolated heterologous protein produced according to the methodof claim
 50. 52. A host cell, comprising first and second expressionvectors, said first vector comprising nucleic acid encoding a hybridenzyme, said hybrid enzyme comprising at least a portion of atransmembrane region of a first glycosyltransferase and at least aportion of a catalytic region of a second glycosyltransferase, saidsecond vector comprising nucleic acid encoding a heterologous protein.53. The heterologous protein isolated from the host cell of claim 52.